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Semidirect sum of the functions Fj : Xj ^ R u {+ 00 } defined on sub¬ 
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\y\i ■= Va? + i = l-..n. 

Gradient operator (the continuous one or a discrete one) 

Sub differential of the function $ at a; 

(Fenchel) conjugate function of $ 

Closure of the function $ 
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^s{x) = < . 

I 00 otherwise 

Graph of the function g 
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Summary 

Many tasks in image processing can be tackled by modeling an appropriate data fidelity 
term $ : —>■ M u {+00} and then solve one of the regularized minimization problems 

argmin {<h(a;) s.t. < r} 

xsR" 

(-P 2 ,a) argmin{$(a:) + A\['(x)}, A > 0 

xeR" 

with some function \I/ : R” —>■ R u {+00} and a good choice of the parameter(s). Two tasks 
arise naturally here: 

i) Study the solver sets SOL(Pi,-) and SOL(P 2 ,a) of the minimization problems. 

ii) Ensure that the minimization problems have solutions. 

This thesis provides contributions to both tasks: Regarding the first task for a more special 
setting we prove that there are intervals (0, c) and (0, d) such that the setvalued curves 

r ^ SOL(Pi,,), r e (0, c) 

A t—>■ SOL(P2 ,a)) a e (0, d) 

are the same, besides an order reversing parameter change g : (0,c) —>■ (0,d). Moreover 
we show that the solver sets are changing all the time while r runs from 0 to c and A runs 
from d to 0. 

In the presence of lower semicontinuity the second task is done if we have additionally 
coercivity. We regard lower semicontinuity and coercivity from a topological point of view 
and develop a new technique for proving lower semicontinuity plus coercivity. The key 
point is that a function / : R” —>■ [—00, +00] is lower semicontinuous and coercive, iff a 
certain continuation of / to the one point compactification of R*^ is continuous with respect 
to the right order topology on [—00, +00]. 

Dropping any lower semicontinuity assumption we also prove a theorem on the coercivity 
of a sum of functions. More precisely, this theorem gives information on which subspaces 
of R" a sum F + G of functions F, G : R'^ —> [—00, +00] is coercive, provided that F and 
G are of a certain form, namely 

F = Fi kti F 2 and G = Giw G 2 

with functions Fi : W ^ P 2 : W 2 —» R u {+00}, Gi : Fi —» R u {+00}, 

and G 2 : F 2 ^ ® u {+00}, where 

R’^ = Xi©X2 = Fi©F2. 

For such functions the theorem basically states that F + G is coercive on Xi + Yi = 
{X2 n F 2 )''‘ if -L W 2 , Fi T F 2 and certain boundedness conditions hold true. 



Zusammenfassung 

Viele Aufgaben in der Bildverarbeitung lassen sich wie folgt angehen; Nach Modellierung 
eines Datenterms $ lost man eines der folgenden regularisierten Mini- 

mierungsprobleme 


(Pi,r) argmin {d>(a;) s.t. \l/(x) < r} 

a;GR" 

(-P 2 ,a) argmin{$(a:) + A\l/(x)}, A > 0 

xeR" 

mit einer Funktion 'k : R” —> R u {+ 00 } und jeweils gut gewahltem Parameterwert. Es 
stellen sich unter anderem folgende Aufgaben; 

i) Untersuche die Losungsmengen SOL(Pi,-) und SOL(P 2 ^a) der Minimierungsprobleme. 

ii) Stelle sicher, dah die Minimierungsprobleme iiberhaupt Losungen besitzen. 

Diese Arbeit enthalt Beitrage zu beiden Aufgaben; Beziiglich der ersten Aufgabe wird (in 
einem spezielleren Rahmen) die Existenz von Intervallen (0, c) und (0, d) bewiesen derart, 
dab die mengenwertigen Kurven 


r ^ SOL(Pi,,), r e (0, c) 

A^SOL(P2,a), Ae (0,d) 

die selben sind, bis auf einen ordnungsumkehrenden Parameterwechsel g : (0, c) ^ {0,d). 
Desweiteren zeigen wir, dak die Lbsungsmengen SOL(Pi,-) bzw. SOL(P 2 ^a) sich die ganze 
Zeit andern, wahrend r aufsteigend das Intervall (0, c) durchlauft bzw. A absteigend das 
Intervall (0, d) durchlauft. 

Falls Halbstetigkeit von unten gegeben ist, ist die zweite Aufgabe gelost, wenn zusatzlich 
Koerzivitat vorliegt. 

Wir betrachten in dieser Arbeit sowohl Halbstetigkeit von unten als auch Koerzivitat von 
einem topologischen Standpunkt. Grundlegend ist hierbei, dab eine Funktion / ; R'^ —>■ 
[— 00 , + 00 ] genau dann halbstetig von unten und koerziv ist, wenn eine gewisse Fortsetzung 
von / auf die Einpunktkompaktihzierung von R" stetig bzgl. der von den Halbstrahlen 
(a, + 00 ], a E [— 00 , + 00 ) erzeugten Topologie ist. Hieraus wird eine neue Beweistechnik ftir 
den gemeinsamen Nachweis von Halbstetigkeit von unten und Koerzivitat entwickelt. 



Desweiteren beweisen wir einen Satz iiber die Koerzivitat der Summe zweier Funktionen, 
ohne Halbstetigkeit von unten vorauszusetzen. Genauer gesagt liefert dieser Satz Infor- 
mationen dariiber auf welchen Unterraumen des R” die Summe F + G von Funktionen 
F, G : R” —>■ [— 00, +oo] koerziv ist, wenn diese Funktionen von der Bauart 

F = Fi kti F 2 and G = Gi w G 2 

sind mit Funktionen Fi : Xi -> R u {+00}, F2 : X2 ^ R u {+00}, Gi : Fi —> R u {+00}, 
und 6*2 : F 2 —^ R worin 

R” = Xi©X2 = Fi©F2. 

Ftir Funktionen solchen Typs besagt der Satz im Wesentlichen, dak F + G genau dann 
koerziv auf dem Unterraum Xi + Yj = {X 2 n ¥ 2 )'^ ist, wenn Xi _L X 2 , Fi _L 12 und gewisse 
Beschranktheitsvoraussetzungen erfiillt sind. 
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1.1 Definitions, notations and conventions 

Writing A B means that A is a subset of B, whereas writing A cz B indicates that A 
is a proper subset of B. A function f : X Y is genuine or non-trivial, iff X (and 
therefore also Y) is nonempty. 

A (direct) decomposition of a vector space V into subspaces Vi, 14... 14 is a tupel 
(Id, V 2 ,... Vn) of subspaces, such that every v e V can be written in a unique way in the 
form V = Vi + V 2 + • ■ ■ + Vn with Vi eVi for i = 1.. .n. A bit sloppily but practically we will 
also write V = Vi © V 2 © ■ ■ ■ © 14 and call this a (direct) decomposition or direct sum. For 
a given subspace Ui of V & subspace U 2 is called complementary to Ui iff V = Ui®U 2 . 

The set of all n-tuples of real numbers is denoted by W^, where n e Mq. Note that 
containing only the empty tupel, is the trivial real vector space. By Ci, 62 , • • ■ , e„ we name 
the vectors (1, 0, 0,... 0)^, (0,1,0,..., 0)^,... (0,... 0,1)^, which form the standard basis 
of The trivial linear mapping V ^ M, x 0 between a real vector space X and the 
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real numbers will be denoted by Ox- The nullspace (kernel) of a matrix/linear operator 
A is denoted by Af{A) and its range by TZ{A). The transpose of a matrix A is denoted 
by A*. For Euclidean vectors v we will also write v'^. For a vector y = (a, e let 
\y\ denote the vector in M"" whose components are 's/aj + bj —■ \y\i, i = 1.. .n. Usually y 
appears in the form y = Vx with a linear mapping V : M” —>■ M” x M” modeling a discrete 
gradient. 

We also remark that, in the presence of a direct decomposition of M” into subspaces like 
= Xi 0 X 2 © W 3 , we will use the unique decomposition a; = Xi + X 2 + 0:3 of x e 
in its components Xi e Xi,a ;2 e ^ 2 , 0:3 e X 3 without emphasizing the underlying direct 
decomposition every time. Furthermore we will use the notation S = Fi © 5*2 © ■ ■ ■ © Ffc for 
subsets S,Si,...,SkofMA iff every s e S has a unique decomposition s = si + S 2 + • • • + 
into components Sj s Sj, j e {1,..., k}. For convex subsets Ci, U 2 of R” we have C 1 + C 2 = 
Cl © C 2 iff aff(Ui) + aff(U 2 ) = aff(C'i) © aff(C' 2 ), see Theorem B.ll for more details. 

The convex hull of a set S' c is denoted by co(S'). The affine hull of a set S' c 
is named by aff(S'). The (topological) closure and the interior of a set S' c R” will be 
denoted by S and int(S'), respectively. Note that, for any subset A c R"-^ the identity 

_ Q _ 

A = A holds for all B ^ A that are closed subsets of R'^; in particular it does not matter 
whether we form the closure of a subset A of R'^ with respect to R'^ or with respect to 
any affine supperset of A, including aff(yl). The relative interior of a convex set C 
will be denoted by rifU). The relative boundary of a convex set C will be denoted by 
rb(C') := U\ri(C'). 

For a totally ordered set {Z,^) we set 

MAX<c(Z) •■= {z E Z : z is a maximum of Z} 

If it is clear from the context which total order is given to Z we will shortly also write 
MAX(Z). If {Z, <) has a maximum S then MAX<j(Z) = {£}. If {Z, <) has no maximum 
then MAX^(Z) = 0. 

Let Rg := [0, + 00 ) and let Fo(R'^) denote the set of proper, convex, closed functions 
mapping R” into the extended real numbers R u {+ 00 }. For nonempty, affine subsets 
X c R"-^ we define Fo(A) in an analogous way. The closure of a convex function / : 
R"" —» R u {— 00 , + 00 } is denoted by cl/. The closure of a proper convex function is its 
lower semicontinuous hull. See Theorem B.3 for some of the properties of the closure 
operator. For a given function d' : A —^ Z between a set X and a totally ordered set 
(Z, <) we distinguish different types of level sets by the following notations; 

lev^-d^ := leVs^^rT ■= {x e X ■. \h(x) < r] and lev<T-T ■= {x e X ■. 'l/(x) < r}. 

Usually the term “level set” refers to the first type with ‘W”. 

Important lower level sets are the closed balls B,,(a)[[| • j|] := {x e R” : ||x|| < r} of radius 
r e [0, + 00 ), midpoint a e R” with respect to a norm || • ||. If it is clear from the context 
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which norm is meant we use the abbreviation Br(a). If a = 0 we even more shortly 
write B,.. For spheres §r.(a)[|| ■ ||] := {x e R” : ||x|| = r} and open balls Br(a)[|| ■ ||] := 
{x E R” : ||a;|| < r} with midpoint a, radius r e [ 0 , +oo) and r e ( 0 , +oo), respectively, 
we apply similar abbreviations. If more general a metric space {X, d) is given we use 
the notations B/j (a) ■= {x e X : d{x,a) < i?},B/{(a) ■= {x e X ■. d{x,a) < R} and 
S_R(a) ■= {x E X : d{x, a) = R} for the open ball, closed ball and sphere of radius R eM. 
around a e X, respectively. If X = R” is endowed with the usual Euclidean metric we also 
will use the notations B^^(a) := {x e R"- : ||a; — a|| < i?},B^\a) := {x e R”^ : ||x — a|| < R} 
and ^\cl) '■= {x e R'^ : ||a; — a|| = R}. If the dimension n of the underlying Euclidean 
space is clear from the context we also use the abbreviations B^(a),Bj:j(a) and ^^(a). If 
a = 0 and/or r = 1 we sometimes omit the corresponding parts of the notations and write 
e.g. S^, S(a), § or B. 

Further important level sets are half-spaces and hyperplanes. We use the notations •■= 
{x e R'^ : (p, x) < a}, ^ ■■= {x e R” : (p, x) > a} and ^ ■■= {x e R"^ : (p, x) = a} for 

the closed halfspaces, the open halfspaces and hyperplanes, respectively. 

The set of overlapping parameters between a set A and a family {Br)TeT of sets Br 
with some index set T is OP{A, {Br)TeT) ■= {r e T : A Ct Br 0}. In this thesis we will 
consider the case A = dom $ and Br = levT-T, r e R for functions <h, T : R” ^ R u {+oo} 
and use the notation 

0P(<1>, T) := OP(dom<l>, (lev.r^)T£R) = {r e R : dom<h n lev^T =|= 0}. 


Furthermore, the indicator function is of a set S is defined by 

. ^ _ I 0 if a; e S', 
i'S0) t otherwise. 


For xq e R"- the subdifferential d\h(a;o) of T at Xq is the set 

d\h(xo) := {p 6 R” : T(a;o) -I- (p, a; — Xq) < ^(a;) for all x e R”}. 

If T is proper, convex and xo e ri(domT), then d\['(a;o) =H 0- 

Additionally we will need the Fenchel conjugate function of T dehned by 

'F*(p) := sup{(p, a:) — T(a:)}. 

xsR" 


Finally the graph of a function g is denoted by gr g. 

Topological notations and notions 

Definition 1.1.1. IFe say that a topological space {X, O) is nonempty, iffX is nonempty. 
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Definition 1.1.2. Let U he a subset of a set X and let O be a system of subsets of X. 
Then we denote the system 

{U nO:OeO} 

abbreviated by U (L\0. 

If (9 is a topology on X then If ifTl is a topology on U; cf. also Subsection 2.3.1. 

Definition 1.1.3. An open neighborhood of a point x in a topologieal spaee {X,0) is 
just a subset O s O that eontains x. 

A neighborhood of a point x from a topologieal spaee {X, O) is just a subset U ^ X 
eontaining an open neighborhood of x. 

The system of all neighborhoods of x will be denoted by U\0]{x) or, if the underlying 
topologieal spaee is elear from the eontext, simply also by U{x). 

A system B{x) of open subsets of X is ealled an O-neighborhood basis of a point x e X, 
iff every neighborhood U s U{x) eontains some B e B{x). 

We will feel free to adopt our notations for neighborhood systems according to the notations 
for the underlying topological space. For instance in the context of a topological space 
{Xj O') we usually write U'{x') instead of U{x'). 

Remark 1.1.4. Having a neighborhood basis B{x) for every point x of a topologieal space 
{X,0) we can first reconstruct all neighborhood systems L({x), x e X, and then also the 
whole topology by means of the formulas 

U{x) = {U ^ X \3B E B{x) : U ^ B} and O = {O ^ X \yx e O : O e U{x)}. 

See [27, 2.9 SatzJ and its proof for more details. 

Regarding the following definition we note that “limit point” is really meant as limit point 
and not as accumulation point. 

Definition 1.1.5. A seguence in a topological space {X,Ox) is said to have an 

element x e X as limit point iff every neighborhood of x contains almost all seguence 
members, i.e. - more formally expressed - iff 

yu E U{x) 3 NeN yn^ N -.Xn^U 

holds true. The set of all limit points will be denoted by (!lx-fini„^+oo Xn or simply by 
fim„^+oo Xn, if it is clear which topology is given to X. If the seguence has at last one limit 
point we call the seguence convergent. 

Definition 1.1.6. A topological space (X,0) is called a Hausdorff space iff any two 
distinct points have two disjoint open neighborhoods, i.e. for every pair of distinct point 
xi,X 2 ^ X there are open disjoint sets Oi, O 2 e with xi e Oi and X 2 e O 2 . 
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Definition 1.1.7. A topological space {X, O) is called compact if every covering of X by 
sets from O has a finite subcover. 

If the topological space appears as a subspace of another space, see Subsection 2.3.1, the 
following equivalent definition can also be used; 

Definition 1.1.8. Let (X,0) be a topological space. A subspace {X,X (fil O) is called 
compact if every open covering of X with open sets from O has a finite subcover. 

Remark 1.1.9. In some texts the word “compact” is only used for spaces that are in 
addition Hausdorff spaces. 

Definition 1.1.10. Let (X, O) be a topological space. We say that K ^ X is a compact 
snbset of {X,0), iff {K,K (fil O) is a compact space. We denote the system {K c X : 
K is a compact subset of {X, O)} by K{X, O) or sometimes only by K(,X), if it is clear 
which topology is given to X. 

Similarly we denote the system of closed subsets of {X, O) by O) or by A(X) or 

even only by A.. Finally the system of compact and closed subsets of(X, O) will be denoted 

by KAiX, O) or by KA{X). 

Note that ICA{X, O) = IC{X, O) n A{X, O) c IC{X, O) can be a strict subset of /C(X, O), 
cf. Example 2.5.7. 

The following definition is taken from [15, p. 146]. 

Definition 1.1.11. A topological space is locally compact, iff each point has at least one 
compact neighborhood. 

Example 1.1.12. The Euclidean space M”, endowed with the natural topology, is not 
compact, but locally compact, since ]Bi(a;) is a compact neighborhood for an arbitrary point 

X e ML. 

Cf. Remark 2.2.3 for the following definition. 

Definition 1.1.13. A function f : {X,0) —* {X',0') between topological spaces (X,0) 
and {X', O') is called continuous in Xq iff for all open neighborhoods e O' of 

/(xo) there is an open neighborhood Ox^ ^ O of xq with flOx^] ^ ^'f{xo) ■s®?/ 

Oxf, ^ f [^'f{xo)^)- f continuous if f is continuous in all points x s X, i.e. if 

for all open sets O' e O' the pre-image O ■= f^[0'] is an open set from O. 

For the next two definitions cf. e.g. [15, p. 90] and [15, p. 94]. 

Definition 1.1.14. A mapping g : (Y,Oy) {Z,Oz) between topological spaces is called 
open iff every open subset of {Y, Oy) is mapped by g to an open subset of {Z, Oz)- Analo¬ 
gously g is called closed iff every closed subset of (Y, Oy) is mapped by g to a closed subset 
ofiZ,Oz). 
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1. Introduction and overview 


Note that a bijective mapping is open, respectively closed, iff its inverse mapping is con¬ 
tinuous. 


1.2 Motivation from image processing 

Many tasks in image processing such as deblurring, inpainting, removal of different kinds 
of noise or reconstruction of a sparse signal can be tackled by minimizing a (parameter 
containing) function, designed for the respective purpose. Often this function can be 
written as a weighted sum 


$ + Ad' 


of two functions $, d/ e ro(M"), where d> serves as data hdelity term and fh as regularization 
term which influence is controlled by the parameter A. At this point vectors x e model 
gray value images, where n = rixfiy is the total number of pixels. 

Both the family of penalized problems 


argmin(d)(a;) -I- Ad/(a;)) 

xsK" 


and the related families of constrained problems 
argmin(d>(a;) s.t. d/(a;) < r) 


argmin(d>(a;) -h Pev,vi>), 

xsK" 


argmin(d/(a;) s.t. d>(a;) ^ a) 

xsR" 


argmin(d'(x) Pev,<i.) 

xsR" 


(for certain parameter ranges) are considered in the literature. Some examples are: 


• The family of penalized problems 

argmin {^Ax — h\\ + A||x||i), 

xsR" 

along with the families of constraint problems 

argmin (||Aa: — h \2 s.t. ||a;||i < r) 

xeR" 

argmin {\\Ax - b \\2 + dev,(|M|i)(2^)) 

xgR" 


argmin (||a;||i (\\A-b\\ 2 ){x)), 

xeR" 

(Basis pursuit denoising), cf. e.g. [25], [16], [26], [7]. 


argmin (||Aa; — s.t. ||a;||i < r) 


xsR" 


(LASSO problem) and 


argmin (||a;||i s.t. \\Ax — b \\2 < Va) 

xeR" 
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1.2 Motivation from image processing 


The family of penalized problems 

argmin {\\Ax — b\\l + A|||Va;||| ), 

along with the families of constraint problems 

argmin {\\Ax — b\\l s.t. |||Va:|||^ < r) argmin {\\Ax — b \\2 + iiev^(|||v-|||i)(a^)) 


xgR" 


xeR" 


and 


argmin ( 11 Vx 11 s.t. \\Ax - b \\2 ^ Va) argmin (|| |Va;| || + iiev/ 3 :(||A-fe|| 2 )(a^)), 

— ^TTBn ^ 


xsR" 


xeR" 


cf. e.g. [18], [30], [29]. 

The family of penalized problems 


argmin - bklog{[Ax]k)) +A|||Va;||| 


xeR' 


k=l 




along with the families of constraint problems 

argmin ($(a;) s.t. || |Va;|||i < r) argmin (<l>(a;) + <.iev,(|||v-|||i)(a^)) 


xeR" 


xeR" 


and 


argmin (IIIVxl 111 s.t. $(a:) < a) argmin (|||Va:|||i + tiev<,($(.))(a;)), 

xeR" xeR" 


cf. e.g. [9], [23], [5]. 


All this minimization problems are of the form 

argmin(F + G,,) ( 1 . 1 ) 

with functions F,G e ro(M'^) and some regularization parameter r]] for 77 A 0 the function 
Gr^ is often of the form 

G,(.) = G{pL-) 

with a matrix L e and a norm G(-) = || • || on R™' in the penalized cases and the 
indicator function G = iieviG in the constraint cases, respectively. 


7 




1. Introduction and overview 


Two questions arise naturally: How can a good regularization parameter be chosen? How 
can argmin(F + 7 ^ 0 be ensured? Regarding the first question for penalized problems 

argmin {F{x) + A||Lx||) 

there are for instance methods from statistics for choosing a value for A, cf. [28], [1], [11]. 
However, in cases where we have knowledge about the original image Xorig, say in the sense 
of knowing a good upper bound for ||Lxorig||, we can use this upper bound as value for the 
regularization parameter in the constrained problem 

argmin(F(x) s.t. ||T(a;)|| ^ r). 

If we have knowledge about the noise level, say in the sense of knowing approximately 
^(xorig), we can similar choose this approximate value in the constrained problem 

argmin(||La;|| s.t. F{x) < a). 

xeM." 

But even if we had chosen a good parameter r, resp. a, the questions remains how we can 
find a corresponding value for A. 

Regarding the second question it is well known that the lower semicontinous function 
F + Grj —■ has a minimizer if it is coercive, i.e. fulhlls Hjj{x) +00 as ||a;|| —>■ 00 . Often 
it is possible to prove coercivity of 11^ by hand. Since this can be laboriously it would be 
good to have some easy tools which ensure coercivity of such a sum. 

This thesis provides contributions to both the question on how to find for given r a corre¬ 
sponding value A and performs also coercivity investigations. 


1.3 Contributions and a useful inequality 

1.3.1 A method for proving coercivity and lower semicontinuity 

As already mentioned coercivity is a usefull property for proving the existence of a mini¬ 
mizer. The defining condition H{x) —>■ -l-oo as ||a;|| —>■ -l-oo looks somewhat like a continuity 
condition. 

As we will see in Theorem 2.5.16 a lower semicontinous function : M” —>■ [— 00 ,- 1 - 00 ] 
is indeed coercive iff a certain extension H : X [— 00 ,- 1 - 00 ] to a compact topological 
superspace of =: X is continuous with respect to a certain topology 7^ on [— 00 , -l-oo], 
making the latter to a compact space as well. This equivalence between the lower semi¬ 
continuity plus coercivity of the mapping H and the existence of such a certain compact 
continuation H leads to a - as far as the author knows - new technique of proving lower 
semicontinuity plus coercivity. The rough idea is as follows: Assume we know that a func¬ 
tion g : —>■ [— 00 ,- 1 - 00 ] can be written as, say, composition g = g 2 0 gi oi easier functions 




1.3 Contributions and a useful inequality 


gi : R” Y, g 2 : Y ^ [—oo,+oo], where Y is some topological space, such that each 
of them allows a compact continuation gi : X —* Y and g 2 : Y ^ [— oo,+oo]. Under 
certain conditions then also the existence of the needed compact continuation g of g can 
be concluded. The needed compact continuation g is simply obtained if we can directly 
form the concatenation ^2 ° be- if b" = Y. Also if idy allows a compact continuation 
idy : U —>■ U we are done after setting g = g 2 °idY°gi. More surprising and more important 
is the fact that the needed compact continuation g also exists (under certain conditions) 
if the mapping idy allows a compact continuation idy ■. Y ^ Y, cf. Theorem 2.6.2 and 
Theorem 2.6.5. Although the developed theory is quite rudimentary it is already strong 
enough to easily prove for example the following often applied result in image restoration 
which was indeed the starting point of my work. 

Assume that the following mappings are given: 

i) Two matrices / linear mappings H : R” —>■ R'^, K : R" —>■ R® with 

U{H)nU{K) = { 0 }. 

ii) Two proper, lower semicontinuous and coercive mappings (p : [— 00 ,+ 00 ], 

: R® —> [— 00 , + 00 ]. 

Then the mapping /i : R” —>■ [— 00 , + 00 ], given by 

X t—> (pi^Hx) + fj^Kx) 

is lower semicontinuous and coercive. In particular the mapping h takes his infimum 
inf h E [— 00 , + 00 ] at some point in R". 

The corresponding proof can be found in Section 2.7. 

1.3.2 Properties of lower semicontinuous mappings from a 
topological viewpoint 

In the previous section we have mentioned the topology T for [— 00 ,+ 00 ]. More precise 
this is the right order topology which is induced by the natural order on [— 00 , + 00 ]. This 
is the natural topology for studying lower semicontinuity, since a function R” —>■ [— 00 , + 00 ] 
is lower semicontinuous iff it is continuous with respect to the topology T on [— 00 , + 00 ]. 
After investigating some properties of the topological space ([— 00 , + 00 ], T) we will see in 
Subsection 2.5.3 that some well known (and easy to prove) properties of lower semicontinous 
functions are just special cases of common theorems from topology. For instance the general 
statement 

“The concatenation g o f of continuous mappings f, g is again continuous. ” 
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becomes in this context the property 

“The concatenation g o f of a continuous mapping f with a 
lower semicontinuous mapping g is again lower semicontinous.” 

In the same way we can also regard the fact that a lower semicontinuous function / takes 
its infimum on every compact set: The general statement 

“A continuous function maps compact sets onto compact sets” 

reads in our context 


“A lower semicontinous function maps compact sets 
on sets which contain their infimum. ” 

1.3.3 Coercivity of a sum of functions 

Theorem 3.3.6 can be used as an easy to apply tool for investigating coercivity of a sum 
of functions. More precisely, this theorem gives information on which subspaces of M"" a 
sum F + G of functions F,G : MA ^ +oo] is coercive, provided that F and G are of 
a certain form, namely 

F = Fi kk) F 2 and G = Giky G 2 

with functions Fi : Xi ^ M u {+ 00 }, F 2 : X 2 M u {+ 00 }, Gi : Fi —> M u {+ 00 }, 
and (^2 : F 2 —^ where 

© X2 = Fi © F2. 

For such functions the theorem basically states that F + G is coercive on Xi + Yi = 
{X 2 n ¥ 2 )'^ if Xi T X 2 , Yi T Y 2 and certain boundedness conditions hold true. 

If the conditions Xi F X 2 , Yi 1. Y 2 are not fulhlled there is no guarantee that F + G is 
coercive on Xj + Yi. But at least F + G is then still coercive on all those subspaces Zi of 
M” that are complementary to Z 2 ■= X 2 n ^ 2 - 

1.3.4 Relation between the constrained and unconstained 
problems for a rather general setting 

In [5] Ciak et al. considered for an underlying orthogonal decomposition MA = Xi@ X 2 of 
the primal minimizations problems 

argmin{<h(x) s.t. ||La;|| ^ r} 

(T 2 ,a) argmin{<h(x) + A||La;||} 

a;sR” 
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along with the dual problems 

aYgmm{^*{-L*p) + r||p||4, 

(D 2 ,a) argmin{<l>*(-L*p) s.t. ||p||* ^ A}. 

PgR”! 

The function $ there has the special form 

$(3;) = <h(a;i + X2) = (j){xi), 

where 0 : Afi ^ R u {+ 00 } is a function fulhlling some properties. 

In this thesis we extend this setting by allowing a third component in the orthogonal 
decomposition of R'^ = Xi® X 2 ® and demand 

I +00 if 3:3 + 0. 

This extension can become interesting when dealing with data in a high dimensional real 
vector space if the data is actually contained in a lower dimensional subspace. Moreover, 
this extended form has the advantage that a symmetry between $ and is recognizable 
much better in this extended setting as we shall see in Lemma 4.4.1. 

1.3.5 A simple but useful equality 

Here we want to mention Lemma A.2 from the appendix along with its preceding vivid 
explanation. The simple but helpful inequality presented in that lemma is 

II hi II ^ C*||hi + / 12 1 

for all hi and /12 in subspaces Xi,X 2 of R*^ with trivial intersection. Originally this in¬ 
equality was made and proved in the context of Lemma 4.3.11, in which proof it was twice 
used for showing differentiability. However it turned out that using this inequality also 
simplifies the boundedness proof in [5, Lemma 3.1 (i)] as done in the proof of part ii) of 
Lemma 4.3.18. Moreover this inequality was helpful in showing convergence of a sequence 
which appeared in the proof of Lemma B.13. 

1.4 Overview 

This thesis consists of three parts, organized in Chapters 2, 3 and 4. In the first part we 
develop a theory giving rise to a - as far as the author knows - new technique of proving 
lower semicontinuity plus coercivity of functions h. The main ingredients are as follows: 

• Equivalence of lower semicontinuity plus coercivity to the existence of a certain com¬ 
pact continuation h of h. 
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• An analysis of compact continuations, giving a criteria for ensuring that a concatenate 
function h = g o f allows a compact continuation h if g and / have a compact 
continuation / and g. 

Having a function h : M” —>■ [—oo, +oo] we can hence perform the strategy to write this 
mapping as composition h = g o f with mappings / and g that allow certain compact 
continuations in a hrst step. In a second step we can then try to get the needed extension 
of h. 

The hrst part is organized as follows: After recalling some set theoretic topology we intro¬ 
duce the right order topology for the set [— 00 , -i-oo] and prove the mentioned equivalence. 
Then the concept of compact continuations is introduced. An application of the theory to 
an example concludes the hrst part. 

The second part also deals with coercivity. However, lower semicontinuity no longer plays a 
role in this part. After giving dehnitions and developing some lemmata we address the easy 
case of linear mappings before moving towards the main theorem of this chapter, giving 
information on which subspaces of certain sums (Fi w F 2 ) -f (Gi w G 2 ) are coercive. 

In the third part we are interested in the relation between the convex constrained opti¬ 
mization problem 

{Pi,t) argmin{$(a;) s.t. T(a;) ^ r} (1.2) 

xgR" 

and the unconstrained optimization problem 

(-P 2 ,a) argmin{<l>(a;)-f AT(x)}, A ^ 0. (1.3) 

xeR" 

The constrained problem (1.2) is interesting only for r e OP($, T) and can then be 
rewritten as the following unconstrained one: 

argmin{<l>(a;) -F qev,vi/(a;)}. (1.4) 

xsR" 


In the inverse problems and machine learning context the problems (1.2) and (1.3) are 
referred to as Ivanov regularization and Tichonov regularization of optimization problems 
of the form argmin,j,g]R„ {$ (x)}. 

Let SOL(P,) denote the set of solutions of problem (P,). While it is rather clear that 
under mild conditions on <l> and T a vector x e SOL(P 2 ^a); A > 0 is also a solution of (Pi,r) 
exactly for r = 4/(x), the opposite direction has in general no simple explicit solution. At 
least it is known that, under certain conditions, for x e SOL(Pi^t-) there exists A ^ 0 such 
that X e SOL(P 2 ^a)- This result, beeing stated in Theorem 4.2.6 and Corollary 4.2.7, can 
be shown by using that the relation 

Mfl dT(a;) = dqev^(^jvi,(a:) 
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from [12, p. 245] holds true under certain conditions. This result is presented in Lemma 
4.2.3 and proved by using an epigraphical projection or briefly inf-projection, cf. [20, p. 
18+], which allows reducing the intrinsic problem to one dimension. 

After developing some assisting theory we consider particular problems where 

<h(a;) := 0(xi) and 4/ := ||L • || with L e M™’”; 

here Xi is the orthogonal projection of x e dom $ onto a subspace Xi of M” and 0 : Xi —>■ 
Mu {+oo} is a function which fulhlls the following conditions: 


i) dom(/) is an open subset of Xi with 0 e dom^, 

ii) (f) is proper, convex and lower semicontinuous as well as strictly convex and essentially 
smooth, and 

hi) (j) has a minimizer. 

We use the dual problems to prove that in a certain interval there is a one-to-one cor¬ 
respondence between r and A in the sense that SOL(Pi,-) = SOL(P 2 ,a) exactly for the 
corresponding pairs. Furthermore, given r, the value A is determined by A := ||p||*, where 
p is any solution of the dual problem of (Pi,r)- See Theorem 4.4.6 for more details. 

The third part is organized as follows: We first deal with two ways of interpreting each of 
the minimization problems (Pi,r) and {P 2 ,x) and show that these perspectives, though re¬ 
lated, are not equivalent in general. In Section 4.2 we state a known relation between (Pi,t) 
and (P 2 ,a) for a rather general setting, see Theorem 4.2.6. In particular, we provide some 
novel proofs by making use of an epigraphical projection. We also recall Fenchel’s Duality 
relation. Finally we discuss the mentioned Theorem 4.2.6 more in detail. In particular a 
relation between one of its regularity assumptions and Slaters Constraint Qualihcation is 
given. In close connection with Section 4.2 is Section 4.4, where we restrict ourselves to 
homogeneous regularizers and to essentially smooth data terms, which are strictly convex 
on a certain subspace of M”. We prove a relation between the parameters r and A such that 
the solution sets of the corresponding constrained and unconstrained problems coincide and 
determine the A corresponding to r by duality arguments. The intermediate Section 4.3 
provides some theorems and lemmata needed in the proofs of Section 4.4, some of which 
are interesting in themselves. In the Appendix some useful theorems are collected. The 
parts there which are not own work but are taken from the literature are clearly indicated 
by giving references. 

Applications can be found in Section 4 of [5]. Ideas from this chapter were also used in 
[24], 
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2. Coercivity and lower semicontinuity from the topological point of view 


2.1 On the relation between closed and compact subsets 

In this section we recall a known theorem, describing the relation between compactness 
and closeness. 

Theorem 2.1.1. 

i) Each closed subset of a compact space is compact. 

ii) Each compact subset of a Hausdorff space is closed. 


The subsequent proof resembles the proof of Bemerkung 2 in [14, ch. 1.8 on p. 26] and 
the proof of a Lemma in [14, ch.1.8 on p. 28]. 

Proof, i) Let (X, Ox) be a compact space and A a closed subset of this space. Let A be 
covered by open sets Oi e Ox,i ^ I- Adding the open set X\A e Ox to the Oi,i e J, 
yields an open covering of (X, Ox)- Due the compactness of (X, Ox) finitely many of the 
Oi together with X\yl suffice to cover X. Due to (X\yl) n A = 0 these hnitely many Oi 
must already cover A. So {A, A fTil Ox) is compact. 

ii) Let (X, Ox) be a Hausdorff space and A some compact subset. For proving the closeness 
of A it suffices to show that each x e X\A is an interior point of X\A, i.e. that there is 
an open neighborhood U of x with U c X\A. To this end we fix x e X\A. Since (X, Ox) 
is a Hausdorff space, there are disjoint open neighborhoods Oa e If (a) and Ua e lf{x) for 
every a e A. The open cover of the compact set A by the Oa,a e A has a hnite subcover; 
i.e. there are hnitely many ai, ..., e A with IJLi nr=i open 

neighborhood of x with 

n n n n / n \ n 

flc/a, n ^ e f| C7„, r, [JOa, - U e (J (//„, nO.J = 0, 

i=l i=l j=l j = l \i=l / j = l 

i-e. nr=i ^ X\A. So X is indeed an interior point of X\A. □ 

We point out that even a compact topological space can have compact subsets which are 
not closed. An example for this behavior is obtained when equipping the interval [— 00 , + 00 ] 
with the right order topology, see Example 2.5.7. 


2.2 Remarks on the topology induced by a metric space 

In this subsection we hrst recall some well known facts for the topology induced by a metric. 
Then we recall the equivalence of metric continuity concepts and topological continuity 
concepts. 
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2.3 Creating topological spaces from given ones 


Definition 2.2.1. Let {X,d) be a metric space. The topology generated by the ”open“ balls 
r > Of X E X, i.e. the topology 

0[d] ■■= {O ^ X : O is union of ”open“ balls }, 

will be called topology induced by d. If it is clear from the context we will also use the 
short form O for 0\d\ 

Remark 2.2.2. The open balls r > 0, x e X are really open sets from 0\d\. 

Remark 2.2.3. Let {X,d), {X',d') be metric spaces and {X,0), {X',0') the induced 
topological spaces. For a mapping f : X X' the metric continuity notions and the 
topological continuity notions are the same; speaking in particular about the continuity in 
a single point xq we have the equivalence of the following statements 

i) / : {X,d) {X',d') is continuous in Xq in the metric sense, i.e. 

Ve > 0 35 > 0 \/xeX : 

d{x,xo)<6 d'{f{x),f{xo))<e 

ii) / : {X,0) {X',0') is continuous in Xq in the topological sense, i.e., 

for every open neighborhood O' e O' of f{xo) there is an open neighborhood O e O 
of Xq with f[0] c O' (which is to say O c f^[0']). 

Similarly, speaking about continuity of the whole function, we have the equivalence of the 
statements 

i) / : {X,d) {X',d') is continuous in the metric sense, i.e. 

\/xoeX Ve > 0 35 > 0 \/xeX : 

d{x,xo)<S d'{f{x),f{xo))<e 

ii) / : (X,0) —» (X',0') is continuous in the topological sense, i.e. 

VO' e O' : f-[0'] e O. 


2.3 Creating topological spaces from given ones 

In this section we give a short introduction in four known ways of generating topological 
spaces from given ones: 

• In Subsection 2.3.1 we discuss how a subset of a topological space can be made to a 
subspace by giving it the ”correct“ topology. 

• In Subsection 2.3.2 we show how to equip finite products of topological spaces with 
a meaningful topology. 
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• In Subsection 2.3.3 we deal with the vivid notion of glueing a given object and how 
we can formalize it in the language of topology. 

• In Subsection 2.3.4 we extend every topological space to a compact one by adding 
one single new point. 

In each of this four subsections we give motivations for the definition. We remark that our 
motivation for the identihcation topology seems to be new. 

2.3.1 Subspaces 

Let (X, d) be a metric space and (X, d) = (X, d\y^y) some metric subspace. After choosing 
a point X e X c X and some ”radius“ r > 0 we can think of an open ball of radius r around 
X in two ways - on on the one hand with respect to (X, d) and the other hand with respect 
to (X, d). Though they are different in general, they are linked via 


]Br(^)[d] ={x e X : d{x, x) < r) 

=X n {x e X : d{x, x) < r) 

=X n ]B,.('r)[d]. 

For any T* e X and r* > 0, f e / we therefore have 

y B,.,(Ti)[d] = X n y B^,(Tj)[d]. 

iel iel 

So 0[{X, d)] = X fRl 0[{X, d)]. This gives rise to the following dehnition. 

Definition 2.3.1. Let (X,0) be a topologieal spaee and X c X. ITe eall (X,0) a sub¬ 
space of (X, O), iff O = X (h\ O. The topology X (hi O is ealled subspace topology for 
X c X. To the eontrary a topologieal space (X,0) is called a superspace of a space 
(X, O), iff the latter is a subspace of the first. 

The following remark illuminates that the above topology is the appropriate topology for 
subsets of a already given topological space. It states that the continuity of a function 
/ : (X, O) (Y, V) does not get lost by restricting its domain and by extending its 
codomain; 

Remark 2.3.2. Let (X, O) be a topological space with some subspace (X, O) = (X, X(hO) 
and let {Y,V) be a topological space with some superspace {Y,V). Then the following holds 
true for all mappings f : X ^ Y: 

i) / : (X, O) ^ {Y,V) is continuous f\y : {X,0) —* {Y,V) is continuous. 

ii) / : (X, O) —* (Y, V) is continuous / : (X, O) (Y, V) is continuous. 
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2.3.2 Product spaces 

Let (Yi, C>i),..., (Yn, On) be topological spaces. We search a topology O for the Cartesian 
product Y ■■= Yi, X ■ ■ ■ X Yn such that for any sequence in Y the equivalence 

(Vf e {1,..., n} : yf ^ y*) y* 

holds true. To this end we express the left hand side as explicit statement 

^iE{C...,n] yf^sU, (2.1) 

and compare it with the explicit formulation 

Vf/ e U{y*) 3k eN 'ik^k: y^^^ e U (2.2) 

for the right-hand side. On the one hand, to guarantee “(2.2) ^ (2.1)”, we should demand 
that every product U ■= Ui x ■ ■ ■ x Un, where Ui e Ui{y*), is already a neighborhood of 
y*. On the other hand, to guarantee “(2.2) (2.1)”, all those subsets Y Y, which do 

not contain any product Ui x ■ ■ ■ x Un with Ui e Ui{y*), should be barred from beeing a 
neighborhood of y*; i.e we should demand that every U e U{y*) contains some product 
Ui X ■ ■ ■ X Un oi neighborhoods Ui e Ui{y*). Altogether it seems reasonable to demand 

U E U{y^) : « 3f/i e ...,UnS Un{yl) : U ^ U, x ■ ■ ■ x Un 

This leads to the following 

Definition 2.3.3. Let (Yi, Oi), (Y 2 , O 2 ), ■ ■ ■, (Y^, On) be hnitely many topologieal spaees. 
A topology O on the Cartesian product Yi x Y 2 ■ ■ ■ x Yn —■ Y is said to be the product 
topology of Oi, O 2 ,..., On, if one of the following eguivalent conditions is fulfilled: 

i) The neighborhood system U{y*) of a point y* e Y exactly consists of the sets U = 
Ui X ■ ■ ■ X Un, where Ui e Ui{y*), i e {1, ..., n}, and of all subsets of Y which are 
supersets of these sets U. 

ii) The topology O consists exactly of those subsets O Y, which are of the form 
Oi X ■ ■ ■ X On with any Oi e Oi, i e {1, ..., n}, or can be written as union of sets of 
this form. 


The product space [Y, O) will be denoted by 


or by 


(Yi XY 2 X ■■■ xYn,Oi0O2 0---0On) 


(Y'i,0i)®(Y'2,02)®---®(Y;,a). 


As a shorter notation for (T, (T) ® • (gi (Y", O) we will also write (Y^, O^). 

^ -V-^ 

n times 
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In most cases we deal with y = M equipped with its natural topology O = 0\d\, where d 
is the natural metric dehned by d{x, y) = \x — y\. The product topology for M” equals 
its natural topology, i.e. the topology generated by every norm on M”. 

Remark 2.3.4. Let (Ti,C> 2 ,C >3 be some topologies. Then we have Oi <S) O 2 <S) O 3 = 
{Oi ®C> 2 ) = Oi® (O 2 ®Os), i.e. building product spaces is an associative operation. 

The following remark illuminates that the above dehned topology is the appropriate topol¬ 
ogy for the Cartesian product of already given topological spaces. It states that a “multi¬ 
valued” function is continuous iff its component functions are continuous. 

Remark 2.3.5. A mapping f : {X,0) {Yi,Oi) ® (¥ 2 , 02 ) 0 ■■■ ® (Yn,On),x ^ 
f{x) = {fi{x),f 2 {x),...,fn{x)) is continuous if and only if all its component functions 
fi : {X, O) —» (Yi, Oi), is {!,... n}, are continuous. 

Next we state Tichonov’s Theorem for the simple case of building the product of only 
hnitely many compact spaces. For a proof see [17, Theorem 5.7 on p. 167]. 

Theorem 2.3.6 (Tichonov’s Theorem for hnite products). The product space of finitely 
many compact spaces is compact. 

Remark 2.3.7. We only introduced the product space of finitely many topological spaces. 
Although it is possible to declare a product space also for infinitely many topological spaces, 
we have decided to avoid this, more complicated and harder to grasp, construction, since 
we will not need it. 

We conclude this subsection with a remark showing that the order in which the actions of 
building subspaces and product spaces are done have no inhuence on the hnally resulting 
topological space: 

Remark 2.3.8. Given two topological spaces {Xi, Oi) and {X 2 , O 2 ), the Cartesian product 
Xi X X 2 of two subsets Xi c Xi and X 2 ^ X 2 has to be eguipped with a topology. Two 
natural ways of eguipping Xi x X 2 with a topology seem possible: On the one hand Xi x X 2 
can be interpreted as subset of Xi x X 2 and thus be eguipped with the subspace topology 

(W X X2)(h\ {61062). 

On the other hand Xi x X 2 can be seen as Cartesian product of the sets Xi and X 2 and 
thus be eguipped with the product topology 

(Wi (Hi 61 ) ( 8 ) (X 2 fnl O 2 ). 

Luckily these topologies are actually identical since the sets 

(Oi X O 2 ) n (Wi X X 2 ) = (Oi n Xi) x {62 n X 2 ), 
where Oi s Oi, O 2 ^ O 2 , form a base for both topologies. 
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2.3.3 Identification or quotient spaces 

In the following example let O be the natural topology of M and § ;= {x e : ||a ;||2 = 1}. 
Example 2.3.9. Consider the surjective and continuous mapping f : ([0, 27r], [0, 27r] fRl 

O) (§,§n given by 


a; I—> e 


ix 


( cosx 
ysinx 


The impression occurs that the straight line ([0, 27r], [0, 27r] fTilO) is transformed to the circle 
line (§, § fRl by gluing the endpoints 0 and 27r to one and the same point (1,0)"^ = 

/(O) = f{2n) of the circle line. At any other point x e (0,27r), where nothing is glued, 
it seems that nothing essential changes: A small interval-like-neighborhood U' of f{x) 
seems to be just the image f\U^ of some small interval-neighborhood U of x. In contrast 
it seems that a small interval-like-neighborhood U' of (1,0)"^ = /(O) = /(27r) is obtained 
from gluing a small neighborhood, say [0,£:i), o/O e [0, 27r], with a small neighborhood, say 
(27r — 62 , 2ti\, of 27r e [0, 27r]. So whatever point x's S> we consider: It always seems that a 
neighborhood U' of x' is build by taking a suitable Ux e Tl{x), for every x with f{x) = x', 
and then getting U' as union of the images of the Ux, i.e. via 

U'= U f[Ux\, (2.3) 

a:: G [0,2 TT ]: / (tr) — o:: ^ 


or, to put it more vividly, by glueing neighborhoods Ux, x e f [U']. 


The next remark serves as a bridge between the previous example and the subsequent 
dehnition of an identifying mapping. It picks up (2.3) and shows how this naturally lead 
to the dehnition of identihcation topology and identifying mapping. This way of motivating 
the identihcation topology seems to be new. 


Remark 2.3.10 (Motivation for the dehnition of the identihcation topology). Consider a 
surjective mapping f : (X,0) X' between a topological space (X,0) and some set X'. 
Assume that there is a topology O' on X' such that every neighborhood U' of an arbitrarily 
chosen point x' results from gluing neighborhoods of all preimage points x e f^[{x'}]; i.e. 
assume that there is a topology O' on X' whose neighborhood systems fulfill 


U'(x') = lU' ^X' 


yxeriix'}] 3 Uxell{x) 


u'- u nv] 

xef-[{x'}] 


(2.4) 
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2 . Coercivity and lower semicontinuity from the topological point of view 


for every x' e X'. Is it then possible to describe O' in a more direct manner? Due to the 
following equivalences for a subset O' c X' we can give a positive answer to this question: 


O' e O' 

Vx' e O' : O' e U'{x') 

Vx' e O' Vx e /'[{x'}] 30, e U{x) : O'= (J /[O,] 

Vx' e O' Vx e r[{x'}] 3U, e U{x) : O' 3 |J f[U,] 

^ef-[{x'}] 

Vx'eO' Vxe/'[{x'}] 30^eW(x)nO:0'3/[ |J O^] 

^sf-[{x'}] 


Vx' e O' 30 e O : /^[{x'}] c O a O' 3 f[0] 
30 e O Vx' e O' : /^[{x'}] c O a O' 3 /[O] 
30 e O ; /-[O'] c O a /-[O'] 3 O 
30e O : /-[O'] = O 
/-[0']eO. 


Oote the harder implication in (*) holds true, since O, := / [O'] ^ Ux is a 
neighborhood for each x e /-[{x'}] and fulfills /[O,] = O', in virtue of f’s surjectivity. 

Summarizing we can say that necessarily 

O' = {O'c X' : /-[O'] e O}. 


This motivates the following definition. Take note, though, that we did not prove that the 
topology |0' c X' : /-[O'] e O} actually induces neighborhood systems which fulfill (2.4). 

Definition 2.3.11. We say that a mapping f : (X, O) {X', O') between two topological 
spaces {X,0) and (X',0') is identifying or that it glues (X,0) to (X',0'), iff it is 
surjective and 


O' = {O' c X' : /-[O'] e O}. 

The topology O' is called quotient topology or identification topology induced by 
/ and O and (X', O') is called the quotient space or identification space induced 
by f and O. 

The identification topology is uniquely determined by the surjective mapping /, cf. Remark 
2.3.12. 

Remark 2.3.12. If a topological space {X,0) is glued to a topological space {X',0') by a 
mapping g then the, by definition surjective, mapping g is in particular continuous; to see 
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this just compare 

<^X' -.S' E Ox' g^[S'] E Ox) 

{^S' <^X' -.S' E Ox' « e Ox) 

A g is surjective. 

More precisely one can read from the above lines, that a surjective mapping g glues a 
topological space (X,0) to a topological space {X',0'), iff O' is the finest topology on X' 
for which g : (X, O) (X', O') is still continuous. 


with 


g is continuous 
g is identifying 


The relation between ”honieoniorphic“, ’’identifying“ and ”continuous“ is shown in the fol¬ 
lowing diagram. 


/ : (X, Ox) (X', Ox') is a homeomorphism 

/bij. 

\' 

f : (X, Ox) - (X', Oxfi glues (X, Ox) to (X', 

/ surj. and (open or closed) 

/ : (X, Ox) (X', Ox') is continuous 


The relations between the hrst and second row are easy to see, by Remark 2.3.12. The 
implication from the second to the third row is also clear by this Remark. It remains 
to deal with the implication from the third to the second row. Before illustrating this 
condition and then moving towards its justification in Theorem 2.3.14 we would like to 
warn the reader that restricting identifying mappings is more problematic than restricting 
continuous mappings or homeomorphisms; The restriction of a continuous mappings resp. 
homeomorphism are again continuous mappings resp. homeomorphisms. In contrast the 
restriction of an identifying mapping is not necessarily again identifying, cf. Example 
2.3.17. Now we return to our discussion of the implication from the third row to the 
second row. As stated in Remark 2.3.12, every identifying mapping {X,0) —> {X',0') is 
continuous. However, the opposite is not true. The identity mapping id{o,i} : {0,1} ^ {0,1} 
between (X, C>) = ({0,1}, {X, 0, {0}}) and {X',0') = {X,0') = ({0,1}, {X, 0}) is a 
simple, but maybe not very natural, example. A more natural example for a surjective 
continuous mapping, which is not identifying is given in Example 2.3.15. 

The proof of the following lemma, can also be found in [27, p. 109]. 

Lemma 2.3.13. A continuous mapping g : (Y,Oy) {Z,Oz) from a compact space 
{Y, Or) into a Hausdorff space [Z, Oz) is always a closed mapping. In particular g is a 
homeomorphism if g is additionally bijective. 
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Proof. A closed subset of the compact space {Y, Oy) is again compact by part i) of Theorem 
2.1.1; therefore it is mapped by the continuous mapping g to a compact subset of {Z, Oz), 
which is a closed subset of this Hausdorff space, by part ii) of Theorem 2.1.1. Hence g is a 
closed mapping. If g is in addition bijective then the mapping g is also an open mapping, 
since the image g\0\ of every open subset O e Oy can then be written in the form 
g\0] = g'[F\(y\0)] = 5 r[F]\^[y\ 0 ] = Z\g\Y\0], showing that g\0] is the complement of 
the closed set 5r[y\0] and hence an open subset of {Z,Oz). Therefore the mapping g is 
open and continuous and hence a homeomorphism. □ 

Each identifying mapping is also continuous. The converse is not true in general. Yet the 
next theorem gives some sufficient criteria for ensuring that a continuous function is even 
identifying. 

Theorem 2.3.14. A surjective continuous mapping g : (Y,Oy) —» {Z,Oz) is identifying, 
if at least one of the following additional properties is fulfilled: 

i) g is a closed or open mapping. 

ii) (Y, Oy) is a compact space and (Z, Oz) is a Hausdorff space. 

Before proving this theorem, we give an example for a continuous, but not identifying 
mapping g, which is dehned on a simple subset Y of and maps onto a compact interval 
Z. By Theorem 2.3.14 it is clear that Y must not be a compact subset of and 

that g must not be open and closed. We note that our example was inspired by an example, 
given by Kelly in [15, ch. Quotient spaces, p. 95], illustrating that there are continuous 
mappings which are neither open nor closed. The natural topology of M is denoted by O. 

Example 2.3.15. The interval [—1,1] can he generated by putting a single point, say 
(0,1) e into the gap of [—1,1]\{0} = [—1, 0) u (0,1]. This operation is modeled by the 
mapping g : (Y, Oy) (Z, Oz), g{y) ■= yi, where 

Y--= ([-l,l]\{ 0 }x{ 0 })u{( 0 ,l)} 

and 

Z := [-1, 1] 

are endowed with the subspace topologies Oy = Y fRl and Oz = Z <P\0, respectively. 
The projection g is continuous but, however, not identifying: Consider the point 0 e [— 1 , 1 ] 
and its only preimage point (0,1) e Y. The isolated point (0,1) e Y has {(0,1)} =: U as 
smallest open neighborhood. Yet g\U) = {0} is no neighborhood of 0. ITe remark that 
the same reasoning shows that g is not an open mapping; moreover g is neither a closed 
mapping since it maps the closed subset [—1,0) x {0} of {Y,Oy) to [—1,0) which is not a 
closed subset of (Z, Oz). 
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Proof of Theorem 2.3.14- i) Since g : (Y, Oy) {Z, Oz) is surjective we have 

g is continuous (yZ Z : Z e Oz 9 [Z] ^ Oy^ (2-5) 

and 

g is identifying (^Z ^ Z ■. Z e Oz 9 [Z] ^ Oy^ . (2-6) 

So our task of proving ”g is continuous g is identifying“ reduces to verify the statement 

VZ c Z ; g-[Z] e Oy Z e Oz- (2.7) 

In the hrst case that g is open, i.e. fulhlls 5'[Oy] e Oz for all Oy e Oy we are done by 

writing Z = and setting Oy ■= g^[Z]. In the second case that g is closed, i.e. 

fulhlls 5 '[Ay] e Az for all Ay e Ay - where Ay and Az are the systems of the closed 
subsets of {Y, Oy) and {Z, Oz), respectively - we translate all involved statements of the 
previous reasoning from their ’’open set viewpoint“ formulation (2.5), (2.6) and (2.7) to 
the corresponding ’’closed set viewpoint“ formulation, by means of building complements. 
Then the reasoning goes the same way as before. 

ii) By Lemma 2.3.13 the function g maps every closed subset of (Y, Oy) to a closed subset 
of {Z, Oz) and therefore fulhlls i), which implies that g is identifying. □ 

In the next theorem we consider two functions g : (Y, Oy) (Z, Oz) and g' : (Y', Oy') —>■ 
(Z, Oz) which are identical except that their domains of dehnition do not need to be totally 
identical; rather (Y, Oy) shall only to be glued to (Y', Oy/) by an identifying mapping 
I : (Y, Oy) -> (Y', Oy'). The theorem states that g is continuous respectively identifying, 
ih so is g'. 


(y, Oy) 



(r, Oy') 

Theorem 2.3.16. Let g : {Y,Oy) —> {Z,Oz) and g' : (Y',Oy') {Z,Oz) be mappings 
between topologieal spaees, whieh are related via g = g' o I, with a mapping I that glues 
{Y,Oy) to (Y',Oy'). Then the following statements hold true: 

i) g is eontinuous g' is eontinuous. 

ii) g glues {Y,Oy) to {Z,Oz) g' glues {Y',Oyi) to {Z,Oz)- 

See also [15, p. 95 - 96] for the hrst part of the subsequent proof. 
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Proof. Since I is identifying we have, for every subset Z of Z, the equivalences 


r[Z] e Oy, 

Having this in mind we get 
g is continuous 


and 


g is identifying 


r[g'-[Z]] E o 


Y 


VZ Q Z : Z E Oz 
VZ C z : Z E Oz 

g' is continuous 

\JZ Z -.Z eOz 
\JZ ^ Z -.Z eOz 

g' is identifying. 


g-[Z] E O 


¥■ 


g-[Z] E Oy 
g'-{Z] E Oy' 


g [Z] e Oy 
g'-[Z] E Oy^ 


□ 

We end this subsection with a warning: in general a restriction of an identifying mapping 
is no longer identifying as the following example shows. Again O is the natural topology 
of M and S ;= {x e : ||x ||2 = f}. 

Example 2.3.17. We consider, once more, the both surjective and continuous mapping 
f : ([0, 27r], [0, 27r] fFil O) —>■(§,§ n given by 

f cos x\ 

X ^ e = \ . 

ysinx J 

This mapping is identifying by part ii) of Theorem 2.3.14- Restricting this mapping to the 
subset X ■■= [0, 27r) we get the continuous bijection f\x ■ ([0,27r), [0, 27r) fRl C>) —» (§, S n 
(9®^) which is no longer identifying, since an identifying bijection would necessarily be an 
homeomorphism, cf. the Diagram on page 23. However the spaces ([0,27r), [0, 27r) fRl O) 
and (§, § n are clearly not homeomorphic, since only the latter one is compact. 

2.3.4 One-point compactification of a topological space 

We start with a well known special case before we give the general dehnition. 

Example 2.3.18 (and Definition). R is often convenient to regard M" as the subset 
§(”)\{(0,0,...,0,1)} =: 5 of the sphere ,£>§(»)) == fRl by means of 

the homeomorphism 

TT . (Si, S2, • • • , Sn, Sn+l) ' * 1—('^1’ j ^n) i 
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known as stereographic projection, cf. [17, p. 350], The topological superspace C>§(n)) 
of (S', S ifTl Os(™)) differs not much from the latter: The set 




contains just one point more than S and the topology 0§(n) 2 S'ffTlOgM differs from S'ffTlOgM 
only by additionally containing the open neighborhoods of the ’’north pole“ (0,0 ,..., 0,1) =: 
N as expressed by 


(9s(„) = (^ ffTl Os(„)) 0 {O G 0§(„) :NeO} 

= (^ ffTl C>s(„)) 0 : A e ^(§(”)), A c 

= (^ ffTl C>s(„)) 0 : K e /C(§(”)), K c 

= (^ ffTl C>s(„)) 0 : K e 1C{S)]. 


Likewise we set := M” u {oo} with an additional point oo ^ M" and define 

Qm (O^)oo ■■= 0 {K\K : K e /C(R^)}. 

Then (R^,Or) is a compact topological space, called the one-point compactification 
o/(R”,0®”); it contains (R'^,(9®”) as dense subspace. Moreover the homeomorphism tt : 
(5, 5(fil 0§(n)) —>■ (R”, O^) can be extended to a homeomorphism Ogw) —>■ (R(^, 0®”) 
by setting 7r{N) := oo. Setting ||oo|| := -i-oo we then have for any seguence of points from 
(R”,C>^) the relation 


Xk 


00 


TT ^(Xfc) 

TT~^{Xk) 


TT ^(oo) 

N 


Xk\ —>■ + 00 . 


For general topological spaces {X, O) the procedure is done similarly by adding a new point 
00 , resulting in the set Xoo ■= X u {oo}, and by equipping oo with an appropriate system 
of neighborhoods. In the latter we have to be careful if (X, O) is not a Hausdorff space. 
Namely, in this case it may happen that there are compact subsets Ki,K 2 e X{X,0) 
whose intersection Ki n K 2 is no longer compact, see Detail 1 in the Appendix; we would 
therefore fail here, when we were trying to dehne the open neighborhoods of the new point 
00 as the sets 


X^\K, with K E X{X, O), (2.8) 

since the union of the ’’open neighborhoods“ Xco\Xi and Xoo\K 2 is the set (Xoo\A'i) u 
{Xoo\K 2 ) = Xoo\{Ki n K 2 ) which is no longer a ”neighborhood“ of 00 . This problem is 
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solved if we restrict us in (2.8) to those compact subsets K of {K, O) which are additionally 
closed, see Detail 2 in the Appendix. Choosing 

X^\K with K e ICA{X, O) (2.9) 

as the open neighborhoods of oo indeed is the right idea. Before we give the dehnition 
of the general one-point compactihcation in accordance to (2.9) we note that the sets 
X^\K in (2.8) and (2.9) coincide if {X, O) is a Hausdorff space since in this case we have 
/C(X, O) c A{X, O) by part ii) of Theorem 2.1.1. The following general dehnition as well 
as the subsequent Theorem 2.3.20 are, in essence, taken from [15, p. 150]. 

Definition 2.3.19. Let (X,0) be a topological space and co ^ X an additional point. 
The one-point compactification of (X,0) is the space (X,0)or, ■■= (Xa^.Oac,), where 
Aoo := A u {oo} and 0^-.= 0yj {X^\K : K e XA[X, O)}. 

Theorem 2.3.20. The one-point compactification (Xoo, C^oo) of a topological space {X, O) 
is a compact topological space, which contains (X, O) as subspace. (Xqo, Ooo) is a Hausdorjf 
space if and only if X is a locally compact Hausdorff space. 

2.4 Topologization of totally ordered sets and 
topological coercivity notions 

In this section’s subsections 

• 2.4.1 Three topologies for totally ordered sets 

• 2.4.2 The right order topology on an inf-complete totally ordered set 

• 2.4.3 Topological coercivity notions and continuity interpretations 

• 2.4.4 Topological coercivity and boundedness below 

we introduce for a given totally ordered set [Z, <) the right order topology (along with two 
other topologies), give its very simple form in case of totally ordered sets, use it to dehne 
topological coercivity notions and show its good inhuence when investigating boundedness 
from below. 

More precisely we introduce in the hrst subsection three different topolgies for a given 
totally ordered set [Z, <). For us the most important of them is the right order topology 
7^, beeing the suited topology to investigate lower semicontinuity. Also with regard to 
coercivity questions this topology is useful. 

In the second subsection we will see that [Z, 7^) becomes very simple if the underly¬ 
ing totally ordered set is inf-complete. The topology T = 7^ of the topological space 
([— 00 , -l-oo], T) is an important example and will be studied in more detail in Section 2.5. 
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In the third subsection the notions of topological (strong) coercivity towards a set and some 
boundedness notions are introduced. In Theorem 2.4.20 we will see that a mapping / : 
(X, O) —> (X', O') is topological coercive (towards 0) iff a certain extension / : (X, 0)oo 
(X', O')oo' is continuous in the newly added point oo. In case of a mapping / : R” —>■ R™ 
this later turns out to be equivalent to the normcoercivity of /, see Theorem 2.5.18. For 
a mapping f : {X,0) another similar relation can be described if the totally 

ordered set (Z, has a maximum £ and a minimum. In this case / : (X, O) (Z, 7^) 

is topological coercive towards {£} iff another certain extension / : (X, (T)oo ^ {Z,T<:) 

is continuous in the newly added point oo, see Theorem 2.4.21. In case of a mapping 
/ : R” —>■ [— 00 , +oo] this will turn out to be equivalent to the coercivity of /, see Theorem 
2.5.16. 

In the fourth and last subsection we recall the usual global boundedness dehnition for func¬ 
tions / : (X, O) (Z, <) and add two less common, more easier to check, local bounded¬ 
ness notions and show that the local ones imply the global one if / : (X, O) (Z, 7s>) is 

topological strongly coercive towards MAX 5 ;(Z). Note that here Z is not equipped with 
the right order topology but really with the left order topology! 

Finally we mention that the right order topology is a special case of the Scott topology for 
a partially ordered set (Z, c). The latter topology is dehned as the collection of all subsets 
O of Z which fulhll the following conditions: 

i) Along with any z e O also the “upper set” {z e Z : z ^ z} belongs to O; 

i.e. - more formally expressed - the condition M z e O \lz e Z : z ^ z z e O 

holds true, 

ii) Every directed subset S of (Z, c) whose supremum exists and belongs to O has 
nonempty intersection with O, i.e. fulhlls S' n O A 0, 

cf. [21] where Scott dehned this topology using the name “induced topology”. 

2.4.1 Three topologies for totally ordered sets 

Before dehning topologies out of < we remark that we use interval notation just as for R 
endowed with the natural order. In addition we introduce analogues for the unbounded 
real intervals like (— oo,5]. 

Definition 2.4.1. Let (X, <) be a totally ordered set. We use the shorteuts 

h) ■.= [x E X : X < 5}, 
b] ■= {x E X : X ^ b}, 

{a ■■= {x E X : a < x}, 

[a := {x e X : a < x). 
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If the totally ordered set is denoted with a decoration like in we feel free to adopt the 
notation accordingly and write e.g. (fa instead of (a. 


Given a totally ordered set (X, <) we consider three different topologies for it, namely 
two “one sided” topologies and one “two sided” topology. We start with the “one sided” 
topologies, cf. [22, p. 74], But be aware that the dehnition there is not totally correct, see 
Detail 3 in the Appendix. A correct dehnition can be found in [32]. 

Definition 2.4.2. Let (X, <) be a totally ordered set. The system of sets, which are 0,X 
or which can be written as unions of sets of the form (a, with a e X, forms a topology. It 
will be called right order topology for (X, <) and will be denoted by 7^. Analogously the 
left order topology for (X, <) is defined as system of sets which are 0,X or which 
can be written as unions of sets of the form b), with b e X. 

Remark 2.4.3. i) The notations for the right order topology and the left order topology 
for a totally ordered set (X, <) are consistent: Define the inverse order ^ on X via 
X < ?/ : X y for all x,y e X. Then the left order topology 7^ for (X, <) is 

indeed just the right order topology 7< for (X, <). 

ii) The above systems 7^ and 7j> are really topologies on X: By the first part of this 
remark it suffices to prove that T<^ is a topology. 0 and X belong to 7^ by definition. 
Clearly arbitrary unions of sets from 7^ belong again to 7^ by definition of this 
system. Finally also the intersection of two sets T, S e %: again belongs to that 
system: IfT or S is empty we have T n S' = 0 e 7^. Likewise T n S' belongs to 7^ 
if T = X or S = X. In the remaining case T = and S = [Jjgj(sj with any 

index sets I, J and elements ti, sj e X we finally have 


Tr^S = 

U(t. 

n 


-u 

(ti n y (sj 


_iGl 


JeJ 

iGl 

jeJ 


-UU 

iel jej 


(U n 



y (max{ti,sj}. 

ieljej 


Hence we have shown T n S' e 7^ also in this case. 


Now the “two-sided” topology is introduced, cf. [32] and [27, p. 22]. 

Definition 2.4.4. Let (X, <) be a totally ordered set. The order topology for (X, <) is 
the system consisting of 0,X and the “open intervals” 

(a, b) or (a or b) 

where a,b e X, and all unions of the open intervals. 

Example 2.4.5. The order topology for (M, <) is the natural topology o/M which is induced 

^7 I • I ■ 
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Remark 2.4.6. The order topology for a totally ordered set {X, really is a topology: In 
order to avoid dealing with many cases we first represent the sets from the system 0<^ in a 
unified way, which has been mentioned in [32], To this end let «— and —> be two elements 
which are not yet contained in X. Then set 

X-.- 

and extend the total order ^ on X to a total order on X (again denoted by by addi¬ 
tionally setting <—< x and x <—>■ for all x e X. Then 

X = (-,-) (a=(a,-) 

0 = (-,-) b) = {^,b) 

for all a,b e X so that the sets from 0<z appear now simply as the unions of sets of the form 
(a, b) where a,b e X. This representation makes it clear that arbitrary unions of sets from 
(9<c belong again to O^. Moreover the intersection of two arbitrary sets O = 
and P = [Jjej{cj,dj) - with ai,bi,Cj,dj e X and any index sets I,J - can be written in 
the form 

O n P = U ^ = [J{rnax{ai,Cj},mm{bi,dj}) 

iel iel 

jeJ jeJ 

SO that the intersection O n P again belongs to O^. Finally clearly X, 0 e 0<^ so that 0<^ 
really is a topology on X. 

Proposition 2.4.7. Let a totally ordered space {Z, <) be eguipped with its right order 
topology 7^. If {Z, <) has some minimum 'z then the only {Z, P0j-neighborhood ofz is the 
whole space Z. In particular a mapping f : {X, O) {Z, T0 is continuous in all points x 
which are mapped to the minimal element. More formally expressed: U\T0{z) = {Z} and 
\/x E X : (/(x) = ^ / is continuous in x). 

Proof. Clearly the whole space Z is a neighborhood of z. It is also the only neighborhood 
of 2^ since this minimum is never contained in a set (a, a e Z, and hence also not in unions 
of such sets. Let a; e X be a point with f{x) = z. For each neighborhood P of a: we 
trivially have f[U] c Z. Since Z is the only existing neighborhood of F = f{x), this 
inclusion already shows that / is continuous in x. □ 

Recall in the next theorem that a mapping / : {X, <) —^ (X', <') between ordered sets is 
called an order isomorphism iff / is bijective and fulhlls f{xi) f{x2) Xi < X2 

for all Xi,X2 e X. 

Theorem 2.4.8. Let (X,<) and (X', <') be totally ordered sets with their corresponding 
topological spaces (X,7^) and (X',7^/), respectively. For a mapping f : X ^ X' the 
following holds true: 
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i) /// : {X, %:) (X', T^i) is continuous in x* then f{x) /(x^) for all x 

ii) If f : (X,7^) —> (X',7^/) is continuous then f : (X, —> {X\ <') is monotonically 

increasing. 

iii) /:(x,r^)-(x',r^O is a homeomorphism, iff f : (X,<) —> is an order 

isomorphism. 


Proof. 

i) Let / : (X,7^) {X\T<^i) be continuous in x^ e X. For x = x^ we trivially have 

f{x) iff' f{x^). Assume that there is an x > x^ such that /(x) f{x^). This means 

/(x) <' /(x*), because is a total order on X'. Hence /(x^^) e {'fix) =■ U'. Since / is 
continuous in x^ there is an open neighborhood U e U{x^) with /[t/] = {'f{x). Since 

X > X* assures x e 17 we would consequently get /(x) e /[17] c ('/(x) - a contradiction. 

ii) This directly follows from i) 

iii) Let / : (X,7^) —» (X',7^') be a homeomorphism. The continuity of / : (A, 7^) ^ 

(X',7^/) and f~^ : (X',7^/) ^ (A, 7^) yields the monotonicity of / : (A,<) —>■ (A',<') 
and : (A , <') —>■ (A,<), respectively, by part ii). Now let, to the contrary, / : 
(A,^) (A , <') be an order isomorphism. Then the bijective mapping / gives a one 
to one correspondence between the open sets of (A, 7^) and the open sets of (A ,7^') - 
essentially by (a <->■ {'f{a). Thus / is a homeomorphism between these two topological 
spaces. □ 


The following example shows that there are monotone functions between totally ordered 
sets which are not continuous in the deduced topologies. 


Example 2.4.9. Consider the totally ordered sets (A,<) = 
({2, 3}, <'), with the natural orders < on [0, f] and on {2, 3}. 
(A , <'), given by 


fix) 


2 i/xe[ 0 , 1 ) 

3 if X = 1. 


([ 0 , 1 ], <) and (A ',<0 = 
The mapping f : (A, <) —>■ 


is monotone; yet f : (A, 7 ^) —> (A', 7 ^/) is not continuous: The preimage of {3} = jx' e 
X' : x' >' 2} = ('2 e 7^' is the set {1} c [0,1], This nonempty set does not belong to 
T<^, because it is neither the full space X, nor can it be written as union of intervals of the 
form (x where x e [0,1]. 


2.4.2 The right order topology on an inf-complete totally ordered 
set 

In this subsection we give a remark showing that the right order topology gets very simple 
if the underlying totally ordered set (A, <) fulhlls a property called inf-completeness which 
is dehned as follows: 
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Definition 2.4.10. We call a totally ordered set (X, <) inf-complete, ijf each subset 
X c X possesses an infimum inf X e X. 

Remark 2.4.11. The the right order topology becomes very simple if it is given to a totally 
ordered set {X, which is inf-complete: Consider the union of sets [oi with ai e X where 
i runs through some nonempty index set I. Due to the inf-completeness of {X, <) we know 
that inf{ai : i e 1} —■ a exists in X so that the union 

U= (a 

iGl 

is again of the very same form as the original sets. In particular 7^ just consists of X, 0 
and the sets of the form (a where a e X. 

Example 2.4.12. Consider the set X ■= (0,1) u (2,4) endowed with the usual order 
The totally ordered set {X, <) is not inf-complete since the interval (2, 3) c X has many 
lower bounds in X but no infimum in X. Setting Oi ■= 2 -\- I, i e N we see that the union 

\J(a. = {2.i) 

ieN 

is neither 0, X nor of the form {a with some a e X. 

2.4.3 Topological coercivity notions and continuity interpretations 

Recall that )C{X, O) denotes the system of compact subsets of a topological space (X, O), 
whereas the system of its compact and closed subsets is denoted by 1CA{X,0). In the 
following we will need the following subsystems. 

Definition 2.4.13. Let (X, O) be a topological space and S' c X. Then we set 

KAs{X, O) := [K E X^(X, O):KnS = 0}, 

Ks{X, O) := {K E X(X, O)-.Kr^S = 0]. 

Note that X.4.0(X, O) = 1CA{X, O). The main idea behind the first definition is to collect 
all those closed and compact subsets of (X, 0) in the set system fCAs^X^O)., which are 
not allowed to hit the set S but which might come “arbitrary close” to S. The idea behind 
the second definition is similar. 

Lemma 2.4.14. Let (0^) be a totally ordered set which has a minimum z. Then the 
following holds true: 

i) All closed subsets of {Z,T0 are compact; in particular JCA{Z,T0 = A{Z,%:). 

ii) If {Z, <) contains also a maximum z then lCA{z){Z, 00) = {Z\U' : Lf e U'{z) n 7^}. 
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Proof, i) No open set O e 7^ contains the minimum ^ except for O = Z. Except for the 
closed set 0 = Z\Z, which is anyway compact, every closed subset A of {Z, 7^) contains 
hence z. In particular any open covering (Tj)jg 7 of such a set A must have a member T, 
which is an open neighborhood of However, the only neighborhood of this minimal 
element is the full space Z by dehnition of 7^. So picking out T = Z already gives a hnite 
subcovering for A Z. Hence the nonempty closed subsets of (Z, 7^) are compact. In 
particular ICA{Z, 7^) = /C(Z, 7^) n A(Z, 7^) = A(Z, 7^). 

ii) Using the previous part we see that the system /CA(zj(Z, 7^) consists of exactly those 
closed subsets of (Z, 7^) which do not contain 2", i.e. of exactly the complements of those 
open sets which contain £. In other words the system /CA(zj(Z,7^) consists of exactly 
the complements of open neighborhoods of This is what the formula /CA(z}(Z,7^) = 
{Z\[/' : U' e U'{z) n 7^} expresses. □ 

The hrst parts of the following two dehnitions stem from [31] where just the name “coercive” 
was used. However we prefer the names “topological coercive” and “strongly topological 
coercive” here. The second parts of these dehnitions are new to the best of the author’s 
knowledge. After stating the dehnitions we give some remarks on them and point out a 
relation to the notions of norm coercivity and coercivity. 

Definition 2.4.15. A genuine mapping f : (X,0) —> between topological spaces 

{X, O) and (X', O') is called topological coercive, iff for every closed compact subset K' 
of X' there is a closed compact subset K of X such that f[X\K] c X'\K'; i.e. - more 
formally expressed - iff 

7K' e 1CA{X', O') 3K e 1CA{X, O) : f[X\K] c X'\K'. 


holds true. 

More generally we say that f is topological coercive towards a set S' ^ X iff for every 
closed compact subset K' of X' which does not hit S' there is a closed compact subset K of 
X such that f[X\K] c X'\K'; i.e. - more formally expressed - iff 

SK' E XAsfX', O') 3K E }CA{X, O) : f[X\K] c X'\K'. 


holds true. 

By replacing “compact and closed” in the codomain in the previous dehnition by “compact” 
we get the following dehnition; 

Definition 2.4.16. A genuine mapping f : {X,0) {X',0') between topological spaces 

{X, O) and (X', O') is called topological strongly coercive, iff for every compact subset 
K' of X' there is a closed compact subset K of X such that /[X\X] c X'\K'; i.e. - more 
formally expressed - iff 

VX' e X(X', O') 3K e ]CA{X, O) : f[X\K] c X'\K' 
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holds true. 

More generally we say that f is topological strongly coercive towards a set S' ^ X 
iff for every compact subset K' of X' which does not hit S' there is a closed compact subset 
K of X such that f[X\K] c X'\K'; i.e. - more formally expressed - iff 

SK' e XsfX', O') 3K e XA^X, O) : f[X\K] c X'\K' 

holds true. 

Remark 2.4.17. A genuine mapping f : (X,0) (X',0') is topological coercive iff it is 

topological coercive towards 0. Likewise the mapping f is topological strongly coercive iff 
it is topological strongly coercive towards 0. 

Remark 2.4.18. The previous Definitions 2.4-15 and 2.4-16 coincide if the codomain 
{X', O') is a topological space whose compact sets are all closed, e-g- if {X', O') is a 
Hausdorjf space, cf- Theorem 2-1-1- In later applications however the codomain will be a 
totally ordered set eguipped with the right order topology which contains compact sets that 
are not closed, so that the definitions no longer coincide- 

Remark 2.4.19. Although the notion of topological coercivity is defined in the context of 
any topological spaces {X, O) and {X', O') it is rather made for noncompact spaces {X, O) 
and {X', O'); if one of these spaces is compact the notion of of topological coercivity becomes 
uninteresting: If {X, O) is compact then every genuine mapping f : (X, O) {X', O') 
from {X, O) to any topological space (X', O') is trivially topological coercive since we can 
always choose K ■= X. If, on the other hand, the space (X', O') is compact we can choose 
K' ■-= X' so that a genuine mapping f : (X, O) (X', O') is topological coercive iff (X, O) 
is compact. 


In Subsection 2.5.4 we will define the notion normcoercive for mappings / : W" M™ 
and the notion coercive for mappings / : —>■ [—oo, +oo] and see that these notions 

are special cases of topological coercivity towards a set: One the one hand a mapping 
/ : R” ^ R™' is normcoercive iff it is topological coercive, i.e. topological coercive towards 
0, see Theorem 2.5.18. On the other hand a mapping / : R'^ ^ [—oo, +oo] is coercive iff 
it is topological coercive towards max[—oo, +oo] = {+oo}, see Theorem 2.5.16. For proving 
these equivalences the subsequent two theorems will be helpful. 

The first of these theorems states that the topological coervivity of a mapping / : (X, O) 
(X', O') can be viewed as continuity at “infinity”; 


Theorem 2.4.20. Let (X, O) and (X', O') be topological spaces and (X, O)oo and (X', O')oo' 
their one-point compactifications. For a mapping f : X X' and its extension f : Xqo —>■ 
X'^,, given by 


fix) 


f{x), ifxsX 
oo', if X = CO 


the following are eguivalent: 
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i) / : (X, O) {X', O') is topological coercive. 

ii) / : (X, 0)co —* (X', 0')co' is continuous at oo. 

Proof. Using the definitions of topological coercivity and the definition of the one point 
compactification we get 

/ : (X, O) —» (X', O') is topological coercive 
« VX' e 1CA{X', O') 3K e }CA{X, O) : f[X\K] c X'\K' 

« VX' e XA(X', O') 3X e XA{X, O) : f[X\K] u {oo'} c {X' u {oo'})\X' 

« VX' e XA{X', O') 3Ke XA{X, O) : /[Xoo\X] c X'^,\K' 

« VX e X'(oo') 3U E X(oo) : f[U] c U' 

/ : (X, 0)ac —>■ (X', 0')ac' is continuous at the point oo. 

□ 


Regard now a mapping / : (X, O) {Z, 7^) where 7^ is the right order topology induced 
by some total order < on Z. If Z has a minimum and a maximum we can similar regard 
the topological coercivity of / towards maxX' as continuity at “infinity”; 


Theorem 2.4.21. Let (X, O) be a topological space and [Z, <) a totally ordered set which 
has a minimum 'z and a maximum z. For a mapping f : X Z and its extension 
f : X^ Z given by 


fix) 


the following are eguivalent: 


f{x) ifxEX 
z if X = CO 


( 2 . 10 ) 


i) f :(X,0]^(Z,Tt) is topological coercive towards {z} = {max<c Z}. 

ii) /:(x,o)oo-(xr^) is continuous at the point oo. 

Proof. Using part ii) of Lemma 2.4.14 and /(oo) = z we obtain 

/ : (X, O) {Z, T<f) is topological coercive towards {z} 
« VX' e XA{^}{Z, T^) 3K e XA{X, O) : f[X\K] c Z\K' 
« VX e U'{z) n%: 3Ke 1CA{X, O) : f[X\K] c Z\(Z\U') 
« VX E If (z) 3Ke XA{X, O) : f[X\K] c X 

« VX E If'(z) 3 Ke XA(X, O) : /[Xoo\X] c [/' 

« VX e If'(z) 3U E If(oo) : f[U] c X 

/ : (X, C>)oo {Z, T<:) is continuous at the point oo. 


□ 
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2.4.4 Topological coercivity and boundedness below 

In this subsection we deal with the relations between one global and two local bound¬ 
edness notions and give a sufficient criteria when local boundedness implies the global 
boundedness, cf. also [4, p. 240f]. 

We hrst give the dehnitions of the mentioned boundedness notions. 

Definition 2.4.22. Let f : X ^ Z be a genuine mapping from a topologieal spaee {X,0) 
to some totally ordered set {Z, ^). We eall f bounded below, if there is some z e Z sueh 
that f{x) ^ z for all x e X. We eall f locally bounded below, iff every point xq e X 
has a neighborhood U e U{xq) where f\u is bounded below; i.e. - more formally expressed 

-^jf 

Vxo eX 3Ue U{x) 3zeZ ^x e U : f{x) ^ z 

holds true. Similarly, we eall f compactly bounded below, iff f is bounded below on 
every eompaet subset of X; i.e. - more formally expressed - iff 

yx E IC{X, O) 3 zeZ yxEK : f{x) ^ z 

holds true. 

The next proposition shows relations between these boundedness notions. Note therein 
that the relation between "locally bounded below" and "compactly bounded below" is 
similar to the relation between the notions "locally uniform convergence" and "compactly 
(uniform) convergence"; Local boundedness below always implies compact boundedness 
below; in locally compact spaces the two notions even coincide. Note further that all three 
boundedness notions for a mapping / : (X, O) {Z, <) coincide if / : (X, O) {Z, %,) 
is topological strongly coercive towards MAX^(Z). 

Proposition 2.4.23. The different boundedness notions for a funetion f : (X, O) —> {Z, <) 
between a topologieal spaee and a totally ordered spaee are related as follows: 

f bounded below 
f loeally bounded below 

(X5O) loc. comp. 

f eompaetly bounded below 

Proof. Clearly boundedness below implies locally boundedness below. Next, let / : X —» Z 
be locally bounded below. For every x e X there is then some - without loss of generality 
open - neighborhood of x and some z^ ^ Z such that 

fix) ^ 


"(Z.T^) (op, str. coerc. t.jw. MAXp((?) 


37 




2. Coercivity and lower semicontinuity from the topological point of view 


for all X E Ux- Let now K be some - without loss of generality nonempty - compact subset 
of X. Clearly the sets Ux, x e K form an open covering of K. By the compactness of K 
there are finitely many xi,X2, ■ ■ ■ xj^ e K with 

N 

U U„ 3 K. 

n=l 

Setting z := min{zx^, Zx^, ■ ■ ■, hence get f{x) ^ ^ for all x e K, so that / is indeed 

compactly bounded below. 

Assume now that (X, O) is additionally locally compact and let to the contrary / be 
compactly bounded below. Every xq e X has some compact neighborhood K —■ U. For 
this compact set there is some z e Z such that f{x) ^ for all x e K = U. Thus / 
is locally bounded below. Finally we consider a mapping / : {X,0) {Z,T^) which is 

topological strongly coercive towards MAX^(Z) and show that / : (X,0) -> (Z,^) is 
already bounded below if it is compactly bounded below. Assuming the latter we reason 
dependent on the cardinality of Z. If Z contains at most one element then / anyway is 
bounded below. Otherwise we choose any z' e Z\MAX^(Z) and consider the set 

K' ■■= {zE Z : z z'}. 

The set K' is a compact subset of {Z,T^) by Detail 4 in the Appendix. Therefore and 
since / : {X,0) -> (Z, 7j>) is topological strongly coercive towards MAX^(Z) there is a 
compact set K e /C(X, O) with f[X\K] c Z\K', i.e. 

f{x) > z! for all X e X\K. 

Moreover the compactly lower bounded function / : (X, O') —> {Z, <) is bounded below on 
K, i.e. there is a z" e Z such that 

f{x) ^ z' for all X E K. 

Summarizing we have /(x) ^ min{z',z"] for all a; e X, so that / is indeed bounded 
below. □ 


2.5 The topological space ([—oo,+oo], 7^) 

In subsections 

• 2.5.1 A topology on [ — oo, +oo] suited for lower semicontinuous functions 

• 2.5.2 Properties of the topological space ([ — oo,+oo],r) 

• 2.5.3 Known properties of lower semicontinuous functions revisited 
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• 2.5.4 Coercivity properties versus continuity properties 

• 2.5.5 Continuous arithmetic operations in ([— oo,+oo],7l 

we equip [— oo,+oo] with the right order topology T = 7^, study some properties of the 
resulting topological space ([— oo,+oo], T), allowing us to see known properties of lower 
semi continuous functions in a topological light, show that coercivity can be regarded as 
continuity, and that there is a continuous addition on [— oo, +oo] if the topology T is 
installed on [— oo, +oo]. 

A key role for establishing a - as far as the author knows - new topological method for 
proving lower semicontinuity plus coercivity of a function is due to Theorem 2.5.16, which 
allows us to replace the task of proving the lower semicontinuity and coercivity of a function 
h : M” —>■ [—00, +oo] by the task of showing that h admits a certain continuous extension. 

2.5.1 A topology on [— oo,+oo] suited for lower semicontinuous 
functions 

In this subsection we search for a topology T for the interval [— oo, +oo] which is suited 
when dealing with lower semicontinuous functions. 

Definition 2.5.1. A function / : M"" —>■ [—oo, +oo] is called lower semicontinuous or 
Isc, iff it has one of the following equivalent properties: 

• ^x,Xi,X 2 ,X:i,-■ ■ eW ■. Xi ^ X /(x) < liminb^+ 00 /(x«), 

• /-[[ —00, a]] is closed for all a e (— oo, +oo). 

These conditions are really equivalent, cf. [19, Theorem 7.1]. 

We start with a consideration which will lead us to the dehnition of our topology for 
[-00, +oo]. 

Let / : R” —>■ [—oo, +oo] be a function. Referring to the natural topology of R'^, when 
speaking about “open” and “closed” sets, we have 

/ is Isc f^[[ —00, a]] is closed for all a e (— oo, +oo) (2-11) 

/^[(a, +oo]] is open for all a e (— 00,+oo). (2.12) 


Agreement. In the rest of this thesis the interval [—oo, +oo] will - unless otherwise stated 
- be equipped with the topology created by taking the above sets {a, +oo],q; e (—oo, +oo) as 
subbasis, i.e. with the topology 

T ■■= {0, [-00, +oo], {a, +oo] : a e (-oo, +oo)}. 
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which is the right order topology 7^ for the inf-complete, totally ordered space ([—oo, +oo], <), 
cf. Remark 2.4-11- Only in a few situations we will equip [—oo, +oo] with the “just opposite” 
topology 

% = { 0 , [-00, +oo], [-00, /?) : /3 e (-oo, +oo)}. 

By equivalence (2.12) a function / : —>■ [—oo, +oo] is lower semicontinuous if and only 

if the preimages of all sets (q;,+oo],q; e (—oo,+oo), are open sets. Since the intervals 
(a, +oo], q; e (— 00 , +oo) form a subbasis of T we further have 

/^[(a, +oo]] is open for all a e (—oo, +oo) 
f^[T] is open for all T e T 
/ ; (R”, —>■ ([—00, +oo], T) is continuous. 

In summary we obtain the following theorem, cf. [10, Examples II - 2.3 (3)] 

Theorem 2.5.2. For a mapping f : R” —>■ [—oo, +oo] the following are equivalent: 

i) / : R'^ ^ [—00, +oo] is lower semicontinuous, 

ii) / : (R"^, 0 ®”) —> ([—00,+oo], T) is continuous. 

By this theorem the notion of lower semicontinuity can be extended to a broader class of 
functions, while staying consistent with the dehnition for functions / : R'^ —>■ [—oo, +oo]. 

Definition 2.5.3. Let a set X be endowed with some topology Ox- A mapping f \ X 
[—00,+oo] is called lower semicontinuous iff f : {X,Ox) ([— oo,+oo],r) is continu¬ 
ous. 

The topology T on [—oo, +oo] does not only allow to regard the notion of lower semi conti¬ 
nuity as continuity; also the notion of coercivity can be viewed as continuity property, see 
Theorem 2.5.16. 

2.5.2 Properties of the topological space ([— oo,+oo], 7^) 

The topology T is not induced by a metric on [—oo, -i-oo] since otherwise every two dis¬ 
tinct points would have non-overlapping neighborhoods, but this is obviously not the case; 
consider for example the points Xi = 1 and X 2 = 2 and any two neighborhoods Ni and N 2 
of Xi and X 2 , respectively - the intersection A^i n iV 2 ^ [2, -l-oo] is not empty. Only by this 
fact that ([—00, -1-00], T) is not a Hausdorff space the following phenomena are possible: 

i) A sequence {yk)km in ([—+00], T) can have several limit points at the same time. 
In particular, —00 is a limit point of any sequence {yk)ken in ([—go, -|-oo],T). 
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ii) The space ([—oo, +oo],T) contains compact subsets that are not closed. 

Illustrations of these phenomena can be found in Example 2.5.4 and Example 2.5.7, 
respectively. Phenomena i) is completely explained by Theorem 2.5.5. 

Example 2.5.4. Consider the constant sequence {yn)nen = (l)neN Ihe topological space 
([— 00 , +oo], T). On the one hand every y e (1, +oo] is not a T -limit point of {yn); indeed, 
the neighborhood U ■= {z, +oo] ofy, where z is any point between 1 and y, does not contain 
even one single sequence member. On the other hand every y e [—oo, 1] is a T-limit point 
of (yn); indeed, any neighborhood ofy contains the set [y, +oo] and hence even all sequence 
members. 

More generally we have the following theorem; 

Theorem 2.5.5 (Limits of sequences in ([—oo,+oo], T)). Let (|/„)neN be a sequence in 
[—oo,+oo]. A point y e [—oo,+oo] belongs to T-hm„^+ool/n? iff y ^ liminfn->+oo?/n- In 
particular the point —oo is T-limit point of every sequence in ([—oo, +oo], T). 

Proof. Consider Erst the case y = —oo. Then clearly y ^ lim inf„^+oo |/n and also y e 
T-lim„_^_i_oo because the only T-neighborhood of y = —oo is [—oo, +oo] which contains 
trivially all yn- Hence the claimed equivalence holds true in this case. In the other case 
y E (— 00 , +oo] we have y f T-lim„^oo dn iff there is some neighborhood (a, +oo] of y where 
a E (—oo,y) such that yn f {a, +oo] for inhnitely many n e N, i.e. iff lim inf„^+oo |/n < y 
holds true. So the claimed equivalence holds true also in that case. □ 

Theorem 2.5.6 (Compact subspaces of ([—oo,+oo],T)). For nonempty subsets K c 
[— 00 , +oo] the following are equivalent: 

i) {K, K 1T\T) is a compact subspace of ([—oo, +oo], T). 

ii) inf K belongs to K. 

In particular the whole space ([—oo, +oo],T) is compact. 

Before proving this theorem we give an example that shows that the space ([— oo,+oo],r) 
has compact subsets which are not closed. It also illustrates that - in contrast to the 
inhmum - the supremum of compact subsets of ([—oo, +oo], T) needs not to belong to the 
compact set. 

Example 2.5.7. Consider the set K ■= [0,1). {K,K (fil T) is compact by Theorem 2.5.6; 
yet K is not a closed subset of ([—oo, +oo],T), since its complement [—oo,0) u [1, +oo] is 
obviously not an open set from T. Furthermore K does clearly not contain its supremum 

1 . 


41 




2 . Coercivity and lower semicontinuity from the topological point of view 


This examples and part i) of Lemma 2.4.14 shows .4,([— 00 , +oo],T) c /C([-oo,+oo],T). 
Such a relation can never be true in Hausdorff spaces (X,0), where we rather have 
A{X,T) 3 IC{X,T), due to part ii) of Theorem 2.1.1 or even A{X,T) 3 X{X,T) if 
the space (X, O) is not compact. 

Proof of Theorem 2.5.6. Let {K, X (fil T) be any nonempty compact subspace and let k e 
[—00, +oo] denote the inhmum of K. In the first case k = +co the nonemptiness of K 
yields K = {+oo} and thus k s K. In the second case k = —co we must have k e K, since 
otherwise the sets {z, +oo] e T, z e {—1, —2, —3,... } would form an open covering of K 
which can not be reduced to a hnite subcover; so K would not be compact. In the hnal 
third case A: e R we similarly must have k e K since otherwise the sets {A: + ^}, n e N 
would form an open covering of K which has no hnite subcover. 

Let, to the contrary, K now be a nonempty subset of [—oo, +oo] with k ■= inf K e K and 
let (Tj)jg 7 be an open covering of K with sets Ti from T. Due to 

kEK<^[jTi 

iGl 


there is an 1^, e I with k e Ti^. With this open set 

T^. e r\{ 0 } 

= {[-00, +oo], {a, +oo] : a e (-oo, +oo]} 

we already have found a hnite subcover, because ^ [k, +oo] ^ K. So {K, iL fTil T) is a 
compact subspace of ([— oo,+oo],r). 

Note hnally that [—oo,+oo] contains its inhmum —oo, so that ([—oo,+oo], T) is compact 
by the already proven equivalence. □ 

In the subsequent subsection we will use Theorem 2.5.2 and Theorem 2.5.6 to give a topo¬ 
logical proof of the known results that the composition of a continuous function with a 
lower semicontinuous function is again lower semicontinuous and that a lower semicontin- 
uous function takes its inhmum on any nonempty compact set, respectively. 

2.5.3 Known properties of lower semicontinuous functions 
revisited 

In this subsection we revisit known properties of lower semicontinous functions. We 
will see that these properties stem from Theorem 2.5.2 and the properties of the space 
([— 00 , -i-oo], T). The property we start with is the fact that every composition g o f of a 
continuous mapping / with some lower semicontinuous mapping g is lower semicontinous, 
cf. [20, 1.40 Exercise]. 
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Theorem 2.5.8. Let {X,Ox) and (Y,Oy) be topological spaces, f : (X,Ox) (Y,Oy) 
be a continuous mapping and g : Y —>■ [—oo, +oo] be a lower semicontinous mapping. Then 
the concatenation h ■= g o f : X ^ [—+oo] is again lower semicontinous. 

Proof. The mappings / : {X,Ox) {Y,Oy) and g : {Y,Oy) —* ([—oo,+oo],T) are 
continuous by assumption and by definition, respectively. Hence their concatenation h = 
gof : {X,Ox) - ([ —oo,+oo],T) is again continuous, i.e. h : X [— oo,+oo] is lower 
semicontinous. □ 

Phenomenon i) in Subsection 2.5.2 said that a sequence {yk)kefi in ([“OO, +°C)]; can have 
several limit points at the same time and that — oo is always a limit point. The first part of 
this phenomenon is reflected also in the fact that lower semicontinous functions defined on 
punctured R” can be usually continued in many ways to a lower semicontinous function on 
whole R”, see Example 2.5.9. The second part of this phenomenon is reflected in the fact 
that a function / : {X,0) —>■ ([—oo,+oo], T) is automatically continuous in all preimage 
points of {—oo}, see Lemma 2.5.10. 

Example 2.5.9. Consider the function f : R\{0} —>■ [—oo,+oo], given by f{x) ■= 1. 
Setting /(O) := c with any c e [—oo, 1] we obtain a lower semicontinous function / : R —> 
[-00, +oo]. 


The following lemma is directly obtained as special case of Proposition 2.4.7. 

Lemma 2.5.10. Let {X,0) be a topological space and f : {X,0) —>■ ([— oo,+oo],r) a 
mapping. For every x e X we have 

f{x) = —00 / is continuous in x. 

Proof. Let x e X he a point with f{x) = —oo. For each neighborhood U of x we trivially 
have f[U] c [—oo, +oo]. Since [—oo, +oo] is the only existing neighborhood of —oo = f{x), 
this inclusion already shows that / is continuous in x. □ 

The following theorem says that a lower semicontinous function attains a minimum on 
every nonempty compact subset, cf. [20, 1.10 Corollary]. 

Theorem 2.5.11. Let {X,Ox) be a topological space and f : X [—oo,+oo] be lower 
semicontinous. Then f attains its infimum on any nonempty compact subset of {X,Ox)- 

Proof. The mapping / : (X,Ox) +'^]) "T”) is continuous by Definition 2.5.3. 

Hence every nonempty compact subset K of {X, Ox) is mapped by / to a compact subset 
of ([ —oo,+oo],T). This again compact image f[K] contains its infimum by Theorem 
2.5.6. □ 
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By Theorem 2.5.11 a lower semicontinuous function f : X [—oo,+oo] on a topological 
space {X, Ox) takes its minima on every nonempty compact subset of this space. However 
/ does not need to takes maxima on nonempty compact subsets as the following example 
shows. 


Example 2.5.12. The function / : M ^ [—oo, +oo] given by 


fix) 


— |x| for X ^ 0, 

— 1 for X = 0, 


is a lower semicontinous function that does not attain its supremum 0 = sup,j,g|-_]^ f{x) 
on the compact subset [—1,1] of (M, O). 


We conclude this subsection by giving a table with some properties of the topological 
space ([—00,+oo],T) and corresponding properties of lower semicontinuous functions, i.e. 
continuous functions (X,0) —>■ ([— oo,+oo],r). 


Space ([— 00 ,+oo],T) 

Function / : {X, O) ([—oo, -l-oo], T) 

cf. 

A seq. (|/fc)fceN can 

have many limit points: 
y e T-lim yt 

fc—> + 00 

[- 00 ,?/] C T-limfc_+oo ?/fc 

Making a function value /(a^o) 

smaller preserves lower semicontinuity; 

J(x) yields still 

[ c if a; = xo 

continuous mapping / : (X, O) 

([- 00 , +oo], T) for c e [-oo, /(xq)] 

Thm. 2.5.5 & 
Ex. 2.5.9 

—00 e T-lim for 

fc— >+00 

all sequences iyk)ken in 

[-00, +GO]. 

/(xq) = —00 / cont. in Xq 

Thm. 2.5.5 & 
Lem. 2.5.10 

K' c [— 00 , -i-oo] is com¬ 
pact inf K' e K' 

/ takes a minimum on every compact set 
K ^X 

Thm. 2.5.6 & 
Thm. 2.5.11 


2.5.4 Coercivity properties versus continuity properties 

In this subsection we dehne the notion of coercivity for functions f : MT ^ +oo] 

and see that / is coercive and lower semicontinuous iff extending / to the one point 
compactihcation of M"" by setting /(oo) := +oo yields a continuous mapping / : —>■ 

[— 00 , +oo], see Theorem 2.5.16. This equivalence is the key for a - as far as the author 
knows - new technique for proving coercivity plus lower semicontinuity. See Section 2.6 
and Section 2.7 for more details. 

We also dehne the notion of normcoercivity for mappings / : MT MX and will see that 
this property is again equivalent to a continuity property of some continuation of / to the 
one point compactihcation of M^, see Theorem 2.5.18. 

We start with giving the dehnitions. 
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Definition 2.5.13. A function / : M"" —> [— 00 , + 00 ] is called coercive, iff 

f{x) —>■ +00 for ||a;|| —>■ +00 

A related coercivity notion is given in [6, Definition 1.12], cf. also [6, Example 1.14], For 
the next definition cf. [8, p. 134]. 

Definition 2.5.14. A mapping / : M"" —>■ is called normcoercive, iff 

||/(a;)|| —>■ +00 for ]|x]] —>■ +00 


For a mapping / : ^ R we can speak both of coercivity and normcoercivity. Clearly 

coercivity implies normcoercivity. The contrary holds not true as the following example 
shows; 

Example 2.5.15. The function / : R R given by f{x) ■= x is clearly normcoercive. 
Considering the seguence of the numbers x^ ■= —k for x e N we have \xk\ —>■ +00 as 
k —>■ +00 but f{xk) = —k —00 ^ +00 as k ^ +00 so that f is not coercive. 


The following theorems show that coercivity properties of functions correspond to conti¬ 
nuity properties of special continuations of them - anticipating a name from Section 2.6 
- more precisely of special compact continuations of them. The order topology for the 
interval [— 00 , -l-oo] is denoted by (9^, cf. Definition 2.4.4. 


Theorem 2.5.16. A mapping f : R'^ —>■ 

/ : +“]. given by f{x) := | 

relations: 


[— 00 ,- 1 - 00 ] and its continuation 


f{x), ifxeW^ 
- 1 - 00 , if X = CO 


, are connected by the following 


/ : R” —> [— 00 , - 1 - 00 ] is coercive 

f : (R”,C)'^"') —> ([— 00 ,- 1 - 00 ], T) is topological coercive towards {+ 00 } 
/ : (R^, O^) —>■ ([— 00 , - 1 - 00 ], T) is continuous at the point 00 e R” 
— 00 ,- 1 - 00 ], is continuous at the point 00 . 


/ : R” —>■ [— 00 , - 1 - 00 ] is lower semicontinous and coercive 
f : —>■ ([— 00 ,- 1 - 00 ], T) is continuous. 
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Before proving this theorem we give an example to illustrate part ii). O denotes again the 
natural topology on M. 

Example 2.5.17. The function / : M —>■ [—oo, +oo], given by 

fix) ■■= X 


is lower semicontinous but not coercive. In accordance to part ii) of Theorem 2.5.16 its 
continuation f : (Moo,C>oo) ^ ([“OC))+oo]) given by 


fix) 


fix) = X if X eR 
+00 if X = CO, 


is not continuous; more precise f is not continuous in the newly added point oo since 
there is no compact subset K of R such that a /[Moo\-^] is contained in the neighborhood 
(3, +oo] =: U of +00 for the following reason: Any compact subst K ofRis bounded and 
hence contained in some interval [—A^, A^] with some N e N. Hence the image /[M co\A'] ^ 
/[Moo\[—A^, A^]] = i—cc,—N) u (A^,+oo] ^ {—co,—N) is not completely contained in 
U = (3, +oo]. 


Proof of Theorem 2.5.16. 
i) We have 

/ is coercive 

fix) —> +00 for ||x|| —>■ +00 
Va e M > 0 Vx e M” : ||x|| > fix) > a 
Va e M 3R>0: /[M’"\1k(0)] c («, +oo] 

Va e M 3Ke /C^(R^) : /[M”\A:] c (a, +oo] 
yU e U'i+oo) nT 3 Ke /C^(R^) : /[R”\A:] c U' 

yK' E /C^{+oo}([-oo, +oo], T) 3K E /C^(R^) : /[R”\i^] c [-oo, +oo]\K' 

/ : (R"^, O^) —>■ ([— 00 , +oo], T) is topological coercive towards {+oo}. 

Explanations for the equivalences in (*) and (o) are given in Detail 5 in the Appendix. 
So we have proved the hrst of the claimed three equivalences. The second of the claimed 
equivalences is just a special case of Theorem 2.4.21. Finally the third of the claimed 
equivalences holds true since the system {T e T : +oo e T} of open T-neighborhoods 
of +00 is both a T-neighborhood basis of +oo and an (T,g-neighborhood basis for +oo; a 
detailed proof of the third equivalence can be found in Detail 6 in the Appendix. 
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ii) With Theorem 2.5.2 and part i) we get 



/ 

/ 

f 

f 

f 


is Isc and coercive 


(R”,C)®”) 

([-oo,+oo],r) 

(K”.or)- 

([-oo,+oo],r) 

(K”.or)- 

([-oo,+oo],r) 

K.or)-- 

([-oo,+oo],r) 


is continuous in every a; e M"" 
and coercive. 

is continuous in every x e M"' 
and / is coercive, 
is continuous in every x e M"' 
and in a; = 00 . 
is continuous. 


□ 


Similarly we have the following theorem. 

Theorem 2.5.18. For a mapping f : M"" —>■ and its continuation f : ^ M™, where 

/(oo) := 00 , the following are equivalent: 

i) / : M” —>■ is normcoercive. 

ii) / : (M”, (T®"') —>■ (M™, (T®™) is topological coercive. 

iii) / : (M” , O®"") ^ (M™, is continuous m oo e M” . 


Proof. Similar to the proof of part i) in Theorem 2.5.16 we obtain 


/ is normcoercive 
ll/(^)ll for ||a;|| —>■ +oo 

Vr e M > 0 Vx e M” : ||x|| > ll/(3^)ll > ^ 

« Vr e M 3R>0: /[M”\l^j(0)] c R"^\1^(0) 

& Vr e M 3Ke /C(M”) : /[M”\h:] c R"^\1,(0) 

« VhT' e }C(W^) 3K e /C(R”) : /[M'^\h:] c R"^\h:' 

/ : (R”, O®”) —> (R™-, 0 ®™-) is topological coercive. 

For the equivalence (*) cf. Detail 5. So the equivalence of the hrst two statements from 
Theorem 2.5.18 is proved. The equivalence of the second and the third statement is just a 
special case of Theorem 2.4.20. □ 
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2.5.5 Continuous arithmetic operations in ([— oo,+oo], 7^) 

In this subsection we consider addition and multiplication on [—oo, +oo]. In Theorem 2.5.20 
we show that there is a continuous addition + : ([—oo,+oo]^, T'^^) ^ ([—oo,+oo], T) on 
([—00, +oo], T). This is remarkable, since there is no continuous addition on the topological 
space ([—00,+oo], (T^), no matter which value from [—oo,+oo] we choose for the critical 
—oo + (+oo). Regarding multiplication, however, things are more complicated. For instance 
we will see in Theorem 2.5.22 that multiplication with A e (0, +oo) is continuous, whereas 
multiplication with A e (—oo, 0) is not continuous - but we should rather be happy about 
that; The just mentioned properties of the multiplication £t namely to the facts that 
multiplying a lower semicontinuous function with some A e (0, +oo) gives again a lower 
semicontinuous function, whereas multiplying with A e (—oo, 0) can result in a non lower 
semicontinuous function; 


Example 2.5.19. Consider the function / ; M —» M given by 


fix) 


3 for X < 0 
2 for X ^ 0 


Obviously f is lower semicontinuous (but not upper semicontinuous). Multiplication of f 
with —1 e (— 00 , 0 ) results in the non lower semicontinuous function —f. 


The next theorem shows that there is a continuous addition on [—oo, +oo]. 

Theorem 2.5.20. Continuing the addition on M u {+oo}, by setting +oo + (—oo) := —oo 
and —00 + (+oo) := —oo, we get a continuous function 

+ : ([- 00 , +oo] X [- 00 , +oo], T ® T) ^ ([-oo, +oo], T). 

Setting +oo + (—oo) or —oo + (+oo) not to —oo, but to any other value c e (—oo, +oo] would 
result in a non-continuous mapping. 


Proof. We set —oo+(+oo) and +oo+(—oo) to some values c, d e [—oo, +oo], respectively, and 
ask if the thereby extended addition + ; ([— 00, +oo] X [-00, +oo], T ®T) -> ([-00, +oo], T) 
can be continuous at all in the points (—oo; +oo) e [—oo, +oo] x [—oo, +oo] and (+oo; —oo) e 
[— 00 , +oo] X [— 00 , +oo], respectively. We deal first with the point (—oo; +oo) e [—oo, +oo] x 
[— 00 , +oo] and consider the local mapping behavior of our extended addition near this 
point. To this end note that any neighborhood U of that point contains a subset of the form 
[— 00 , +oo] X [a, +oo], where a e [—oo, +oo), and therefore is mapped to +[U] = [—oo, +oo] 
all the more. So we can achieve continuity in the point (—oo; +oo) e [—oo, +oo] x [—oo, +oo] 
only by choosing a value c whose only neighborhood is [—oo, +oo]; clearly only c = — oo 
meets that demand. Analogously, setting d = —oo is the only chance to get an extended 
addition, which is continuous in the point (+oo; —oo) e [—oo, +oo] x [—oo, +oo]. 
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Now we prove that setting —oo + (+oo) := —oo and +oo + (— 00 ) := —00 really yields a 
continuous mapping 

+ : ([- 00 , + 00 ] X [- 00 , + 00 ], T ®T) ^ ([-oo, +°o],T) 

To this end we show that all preimages 

+ [(c, + 00 ]] = {(fli, ( 22 ) ^ + 00 ]^ : ai + 02 > c} 

= {(o-i, 02 ) ^ + 00 ]^ : oi + 0.2 ^ c} =: Ac 

of the subbasis forming sets (c, + 00 ], c e [— 00 , + 00 ) are again open sets. For this purpose 
we show that every (oi, 02 ) e Ac is an interior point of A^ i.e. that there are neighborhoods 
(oi, + 00 ] of oi and ( 02 , + 00 ] of 02 with 

Too], 62 ^ (® 2 ) Too] ! + 62 T C. 


In the first case 01,02 e M we can choose oi := oi — ^((oi + 02 ) — c) < oi and 02 := 
02 — |((ai T 02 ) — c) < 02 - In the second case 01=02 = Too the job is done by oi := | 
and 02 := In the third case Oi = +00 and 02 e M we can choose any real 02 < 02 and 
then set Oi := 0 — 0,2 < Too. The remaining forth case Oi e M and 02 = Too = Oi can be 
handled analogously by switching roles. □ 


Next we will consider multiplication. We start with the following lemma which allows to 
transfer some of our results about addition to multiplication. 

Lemma 2.5.21. Extending the usual exponential function x ^ e^ via e~^^ := +00 and 
e“°° := 0 gives a homeomorphism ([—00, +00], T) ^ ([0, Too], [0, +00] (fil T). It translates 
the, by means of +00 + (—00) = —00 + (+00) = —00, extended addition into the, by means 
of 0 ■ (too) = +00 -0 = 0, extended multiplication; namely in virtue of 

exp(a:i T 0 : 2 ) = exp(xi) ■ exp( 0 : 2 ) 


for all Xi,X 2 e [— 00 , + 00 ]. 


Proof. Since the extended exponential function is an order isomorphism between the totally 
ordered sets ([— 00 , + 00 ], <) and ([0, Too], < |[o,+oo]x[o,+oo]) we know, by Theorem 2.4.8, that 

exp : ([- 00 , + 00 ], T) ^ ([0, + 00 ], [0, + 00 ] fRl T); 

is an homeomorphism; note here that the subspace topology [0, + 00 ] (fil T is the same as 
the right order topology on [0, Too], generated by < |[o,+oo]x[o,+oo]- D 
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The following theorem deals in its hrst block with multiplication on [0, +oo] and with 
multiplication on [— oo,+oo]. Since the results for the latter are not as satisfying as the 
results in Theorem 2.5.20 we moreover deal in a second block with multiplication 

mx : ([- 00 ,+oo], T) ([-oo,+oo], T), mx{x) ■= Xx 

by a factor X e [— oo, +oo], distinguishing the cases A e (0, +oo), A = 0, A e (—oo, 0), A = 
+00 and A = — oo. We will see that the continuity properties of mx depend heavily on A. For 
instance the following holds true for the mapping mx : ([— oo,+oo],r) ^ ([-oo,+oo],r). 

• For A e (0, +oo) it is a homeomorphism and hence in particular continuous. 

• For A e (—oo, 0) it is discontinuous in every point of [— oo, +oo). 

More precisely we have the following statements. 

Theorem 2.5.22. Considering multiplication as function of two variables the following 
statements hold true: 

i) Continuing the multiplication of non-negative numbers, by setting the problematic 
cases 0 ■ (+oo) := 0 and (+oo) • 0 := 0, we get a continuous function 

■ : ([0, +oo] X [0, +oo], ([0, +oo] x [0, +oo]) ffll (T ® T)) ([0, +oo], [0, +oo] fnl T) 

Setting 0- (+oo) or (+oo) • 0 not to 0, but to any other value d e (0, +oo], would result 
in a non-continuous mapping. 

ii) Continuing the multiplication on M, by setting each of the problematic cases 0 ■ 
(+oo), (+oo) • 0 and 0 • (— oo), (— oo) • 0 to any four values from [— oo,+oo], we get 
a function which is continuous in a point x e [— oo, +oo] x [— oo, +oo], iff 

X e [x e [—00, +oo] X [—00, +oo] : xi > 0 and X2 > 0} 
u{a; e [— oo, +oo] x [— oo, +oo] xi ■ X 2 = — oo}. 

For multiplication by a constant factor the following statements hold true: 

iii) The multiplication mx : x ^ Xx by a factor X e (0, +oo) is a homeomorphism 

mx : ([- 00 ,+oo],T) ([-oo,+oo],T) 

and thus in particular continuous. 

iv) If we agree 0 ■ x = x ■ 0 = 0 also for x = —co and x = +oo then the multiplication by 
0 is also a continuous mapping 


mo : ([-oo,+oo],r) ^ ([-oo,+oo], T). 
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v) The multiplication m\ \ x ^ \x with A e (— 00 , 0) is a mapping 

mx : ([-oo,+oo],r) ^ ([- 00 ,+ 00 ], T), 

which is discontinuous in each point [— 00 , + 00 ); the point +00 is the only one where 
this mapping is continuous. 

vi) Extend the multiplication with +00 by setting the problematic (+ 00 ) • 0 to some value 
c E [— 00 , + 00 ]. This extended multiplication 

m +00 : ([- 00 , +oo],T) ^ ([- 00 , +oo],T) 

+00 for a; > 0 
X t—> (+ 00 ) ■ X ■■= Ic for X = t) 

—00 for a; < 0 

with the factor +00 is then continuous in all x > 0 and in all x < 0. In the point 0 
it is continuous, iff we have set c = — 00 . 

vii) Extend the multiplication with —00 by setting the problematic (— 00 ) • 0 to some value 
c E [— 00 , + 00 ]. The, in this way, extended multiplication 

m_oo : ([- 00 , +oo],T) ^ ([- 00 , +oo],T) 

—00 for a; > 0 
X t—> (— 00 ) ■ X ■■= I c for a; = 0 
^ +00 for a; < 0 

with the factor —00 is then continuous in all x > 0, discontinuous in all x < 0. In 0 
it is continuous, iff c = — 00 . 

Proof, i) With the help of the homeomorphism exp from Lemma 2.5.21 and its higher 
dimensional relative 

([- 00 , + 00 ] X [- 00 , + 00 ], T ® T) ^ ([0, + 00 ] X [0, + 00 ], ([0, + 00 ] X [0, + 00 ]) [Hi (T ® T)) 

{xi,X 2 ) ^ (exp(a;i),exp(a; 2 )) 

we can translate our knowledge from Theorem 2.5.20 about the addition to the current i), 
since those homeomorphisms yield a bijection P between 

C (([-GC),+oo],T)®^([-oo,+oo],r)) 

c(([o,+»],r)®^([o,+»],r)), 

namely via /?(/) ;= gf, where gf{yi,y 2 ) ■= exp (/(exp"^(yi), exp'i(y 2 ))) • Choosing for / 
the extended addition from Theorem 2.5.20 we see that the continuous mapping P{+) is 
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just our, by means of 0 ■ +oo := 0 and +oo • 0 := 0, extended multiplication. It remains to 
show the uniqueness of that extension; assume that there is another continuous extension 
of the multiplication with, say, d = 0 ■' (+oo) e (0, +oo]. Then —■ +' would be a 

continuous extension of the addition with 

—00 +' (+oo) = exp~^(0 ■' (+oo)) = exp~^(d) ^ —oo. 

But such a continuous extension of the addition does not exist by Theorem 2.5.20. 

ii) We show that the (extended) multiplication is continuous in point x with Xi,X 2 > 0. 
To this end let (/?, +oo],/3 e [—oo,a;i • X 2 ) be some neighborhood of xi ■ X 2 > 0. Choose 
any xi,X 2 > 0 with xi < xi and X 2 < X 2 i jd < xi ■ X 2 . Then ('ri,+oo] x ('r 2 ,+°o] is a 
neighborhood of x which is mapped by the (extended) multiplication into {/d, + 00 ]. This 
shows the continuity in x. 

It remains to show that the (extended) multiplication is continuous in a point 

x e ([— 00 , + 00 ] X [— 00 , +oo])\{(xi, 0 : 2 ) : > 0 and X 2 > 0}, 

iff Xi- X 2 = — 00 . Assume that x e ([—go, + 00 ] x [— 00 , +oo])\{(xi, X 2 ) : > 0 and X 2 > 0}. 

Due to the commutativity of the multiplication we may assume xi ^ X 2 , without loss of 
generality, so that we have xi < 0. Since any neighborhood U of x contains a subset of the 
form (xi, + 00 ] X (x 2 , + 00 ] with xi < 0 and X 2 < X 2 we see that -[U] = [— 00 , + 00 ]. Since 
the multiplication • has to be continuous in x the latter equation means that [— 00 , + 00 ] 
must be the only neighborhood of Xi ■ X 2 ', This is only the case if Xi ■ X 2 = —co. Finally 
Lemma 2.5.10 assures that the multiplication is continuous in points x with xi- X 2 = —co. 

iii) The multiplication by a constant factor A e (0, + 00 ) is an order automorphism of 
([— 00 ,+ 00 ], <). Because of part iii) of Theorem 2.4.8 it is therefore a homeomorphism 

([-oo,+oo],r) ^ ([-oo,+oo],r). 

iv) A constant mapping between topological spaces is continuous. 

v) The continuity in +00 is assured by Lemma 2.5.10. Let now x e [— 00 , + 00 ) and choose 
any X 2 > x. Since Xx 2 < Axi we get the discontinuity of m\ in x by part i) of Theorem 

2.4.8. 

vi) The continuity of m +00 in x < 0 is ensured by Lemma 2.5.10. m +00 is also continuous in a 
point X > 0: Let (/9, + 00 ] be any neighborhood of x. Then U ■= (0, + 00 ] is a neighborhood 
of X which is mapped by m +00 into (/3, + 00 ]. Consider the remaining point x = 0. If we 
had set c = —00 we have continuity in 0 again by Lemma 2.5.10. If we had set c > —00 
we can choose any neighborhood (/3, + 00 ] of c. Since every neighborhood f/ of 0 contains 
an element u < 0 we have —00 e m+oo[C], so that we get m+ao\U] $ (/3,+ 00 ] for all 
neighborhoods U of 0; i.e. m +00 is not continuous in 0. 

vii) The continuity of m_oo in points x > 0 is again ensured by Lemma 2.5.10. Yet in 
every point x < 0 this mapping is not continuous by part i) of Theorem 2.4.8, since for 
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any 0:2 > 0 > a; we have m-oo{x2) < ?ti_oo(x). Consider now the remaining point x = 0 . If 
we had set c = —00 we have continuity in 0 once more by Lemma 2 . 5 . 10 . If c > —00 we 
can just argue as before in the case x < 0 to see that m_oo is not continuous in 0. □ 

2.6 Compact continuations 

In this subsection we will introduce and deal with the notion of compact continuation of 
functions / : {V, O) (V', O') between topological spaces (C, O) and (C', O'). This notion 
is, as far as the author knows, new. 

Due to Theorem 2 . 5.16 the lower semicontinuity and coercivity of a mapping /i : M” —>■ 
[—00, +00] can be proven by checking that a certain extension h : M” —> [—00, +00] of h 
is continuous, i.e., in other words, if the mapping h is a compact continuation of h. If 
h = g o f as in Section 2 . 7 , then the question arises if h has that compact continuation 
provided that both / and g have according compact continuations. An answer to this 
question is given in Theorem 2 . 6 . 2 . 

We remark here that this technique goes beyond the technique of proving coercivity of a 
mapping h : M"" —>■ [—00, +00] by writing it as composition h = g o f oi a normcoercive 
mapping / : M” —>■ and a coercive mapping g : R™' —>■ [—00,+00], since the latter 

technique works only for decompositions of h where the intermediate space is R™', whereas 
the first technique can - at least in principle - work also for decompositions into functions 
/ : R” ^ y and g : Y [—00, +00] where the intermediate space can any topological 
space y, like e.g. the product space ([—00, +00] x [—00, +00], Y®T) in the decomposition 
9 = 92 ° gi i^ Lemma 2 . 7 . 1 . 

However our topological technique has two disadvantages: It can not be used to prove 
coercivity of a non lower semi continuous function and more important; Even if we have 
a straightforward choice of continuing each of the concatenated function in h = 5^ o / to 
functions f \ X —* Y and g Y Z it might still be the case that working with our 
technique could be somewhat cumbersome in cases where Y ^ Y. 

We now define the notion of compact continuation. As far as the author knows this notion 
is new. 

Definition 2.6.1. A continuous mapping f : (V, O) (H', O') between topological spaces 
{y,0), (y', O') is called compactly continuable if there is a continuation 

f : (y, O) - (W, O') 


which fulfills: 

i) (y, O) is a compact topological space which contains (V, O) as subspace, 

ii) (y',0') is a topological space that contains {V, O') as subspace, 
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iii) / is continuous and fulfills f{v) = f{v) for all v s V. 

Each such continuation f will be called compact continuation of f. If f fulfills in 
addition f[V\V] c y'\y' we call f a home leaving compact continuation of f. 

Theorem 2.6.2. Assume that the two continuous mappings f : {X,Ox) {Y,Oy), 
g : (Y, Oy) {Z, Oz) have compact continuations 

f:iX,Oy)-^iY,Oy), 

g:{Y, 0 Y)^{Z, 02 ). 

Then g o f =: h has a compact continuation 

h:{X, 0 y)-^{Z, 02 ) 

if one of the following conditions is fulfilled: 

i) idy ; {Y, Oy) —>■ {Y, Oy) has a compact continuation 
\dY-.{Y,Oy)-^{Y,OY). 

ii) idy : (T, Oy) —>■ {Y, Oy) has a compact continuation 

idy : (Y, Oy) (Y, Oy) which, firstly, glues {Y, Oy) to (Y, Oy) and, secondly, 
fulfills idy(2/i) = idy(?/2) ^ giVi) = 9iy2), for all yi,y2 e Y. 

iii) idy ; {Y, Oy) —>• (Y, Oy) has a surjective compact continuation 

idy : (Y, Oy) —» (y, Oy) where, firstly, (Y, Oy) is a Hausdorjf space and, secondly, 
the condition idy(|/i) = idy(1/2) gidh) = ^(2/2) holds true for all yi,y2 e Y. 

If, in addition to i) or ii) / iii), respectively, both f and (idy or idy, respectively) are 
home leaving compact continuations then h can be chosen such that it fulfills 

h[X\X]^g[Y\Y]. ( 2 . 13 ) 

Before proving the theorem we show by an example that g o f, does not need to have a 
compact continuation (X, Oy) —>■ (Z, Oy), if none of the three conditions from the above 
theorem is fulhlled. O denotes again the natural topology of M. 

Example 2.6.3. Consider three copies 

(X, Ox) = (y, Oy) = (y, Oz) = ((0, 27 r), (0, 27 r) fnl O) 

of the real open interval (0, 27 r) along with the identity mappings f = id(o,2,r) : C>x) —^ 

(y, Oy) and g = id(o,27r) : (y, Oy) — » {Z, Oz) between them. We will extend the equal 
mappings f and g in different ways to compact continuations f : (X, Oy) (Y, Oy) and 
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g : {Y, Oy) (Z, O^) such that h ■= g o f can not be extended to a compact continuation 
h-.{X, 0 ^)-^{Z, 02 ). Let 

?■■= id(o,2.Moo} : (( 0 , 27 r),( 0 , 27 r)fnlO)^- (( 0 , 27 r),( 0 , 27 r)fRlO)^, 
g ■■= id[o,2.] : ([ 0 , 27 r], [ 0 , 27 r] fnl O) - ([ 0 , 27 r], [ 0 , 27 r] fnl O) 

and set 

{X, O^) ■.= {Y, Oy) := (( 0 , 27 r), ( 0 , 27 r) CiO)^, 

{Y,Oy) := {Z, 02 ) := ([ 0 , 27 r],[ 0 , 27 r]^ 0 ). 

The functions f and g are compact continuations of f and g, respectively, but it is not 
possible to extend h ■= g o f = id(o,27r) to a compact continuation h : {X, O^) —* {Z, O2); 
indeed, if such a continuous mapping h existed, it would have to map its compact domain 
of definition to a compact subspace of {Z, O2) = ([ 0 , 27 r], [ 0 , 27 r] fTil O); however 

h{X] = /i[( 0 , 27 r) u {00}] = ( 0 , 27 r) u {^(00)} 

would never be a compact subset of ([ 0 , 27 r], [ 0 , 27 r] fTil ( 9 ) - regardless whether h{co) = 0 , 
h{co) = 271 or h{co) e ( 0 , 27 r). 

So we know by the last theorem that none of the conditions i), ii) and iii) can be fulfilled. 
We nevertheless verify this directly, to complete our illustration of the preceding theorem. 

i) is not fulfilled as we just have shown by proving the nonexistence of a compact contin¬ 
uation h : (X,Oy) —» {Z,02), i-c. of a compact continuation id(0 27r) = h : {Y,Oy) —>■ 
(Y,Oy). 

Furthermore ii) and iii) are not fulfilled, since any continuation o/id(o,27r) : ( 0 , 27 r) ^ ( 0 , 27 r) 
to a mapping id(o,27r) : (0, 27 r) u {0, 27 r} —> (0, 27 r) u {00} is not injective any longer, so that 
there is no chance for the injective mapping g to fulfill g{yi) = ^(2/2) in the occurring case 
that id(o,27r)(?/i) = id(o,27r)(l/2) for distinct points yi, y2 e Y. 

Proof of Theorem 2 . 6 . 2 . If i) holds, it suffices to take h ■■= g o idy o f. 

Assume now that condition ii) holds. The mapping g' : {Y, Oy) —* (Z, O2), given by 

g'(y) '■= “^(idy [{y}])“ == g(y), where y is any element of Y with idy(^) = y 

is well dehned since idy(?/i) = idy(|/2) ensures g{yi) = ^(1/2), for ah yi,y2 e Y. The 
dehnition of g' was done in such a way that g = g' o idy. 

(^y) 

X 

(y, Oy) {Z, O2) 
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This implies, firstly, the continuity of g', in virtue of Theorem 2.3.16, and, secondly, g'{y) = 
g{y) at least for all y e Y. Thus we have found a compact continuation g' : (T, Oy) 

(Z, O 2 ) of g : (Y, Oy) —* (Z, Oz)- The concatenation h ■■= g' o f is the needed extension. 

Finally note that the assumptions in iii) imply the assumptions in ii), in virtue of Theorem 
2.3.14. □ 

Next we deal with a special case of Theorem 2.6.2, where the “intermediate” spaces {Y, Oy), 
(Y,Oy), (Y,Oy) and the continuation g have special forms, which will occur, when ap¬ 
plying the theory to our example in Section 2.7. We start with the following preparatory 
lemma. 

Lemma 2.6.4. For locally compact Hausdorff spaces (Y', O') and {Y", O") the following 
is true: 


i) Both [(W, O') <S) (Y", O")]oo and (W, O')oo' ® (Y", O")oo" are compact Hausdorff spaces 
which contain (Y', O') <S) {Y", O") as subspace. 

ii) An extension of id : (W, O') 0 {Y", O") —» (Y', O') ® (Y", O") to a surjective, home¬ 
leaving compact continuation id : {Y',0')ao' ® {Y",0")ao" [{Y',0') ® {Y",0"))ao 

is given by 

— , I {y', y") , if y' e Y' and y" e Y" 

id{y ,y)--=< 

00 , if y = CO or y = 00 . 


Proof, i) Theorem 2.3.20 ensures that both {Y',0')oo' and (Y",0")coi' are compact Haus¬ 
dorff spaces, which contain {Y',0') and {Y",0"), respectively, as subspace. Therefore 
their product space {Y', O')oo' ® (H", O")oo" is compact - in virtue of Tichonov’s Theorem 
2.3.6 - and contains {Y', O') ® {Y", O") = (Y', Y' fTil O'^,) <S) {Y", Y" ifTl 0'f„) as subspace, 
since Remark 2.3.8 allows the reformulation 

{¥', Y' fl O'^y) ® {Y", Y" ft oy) = (Y' X Y", (Y' x Y") If# (O^, ® OJ,,)). 

By Detail 7 the product space {Y', 0')®{Y", O") of two locally compact Hausdorff spaces is 
again a locally compact Hausdorff space, so that its one-point compactification \{Y', O') ® 
(Y", O")]oo is a compact Hausdorff superspace of (Y', O') ® {Y", O"), by Theorem 2.3.20. 
This show the hrst part of the lemma. 

ii) As core part for proving that id is a surjective, homeleaving compact continuation of id 
we have to show that id is continuous; it easy to see, by id’s dehnition, that it fulhlls the 
remaining properties, we had to show. In order to prove the continuity of id we will use that 
the projections H : (W, O') 0 (F", O") (W , O') and tt" : (W, O') 0 (F", O") (F", O") 

to the hrst and second component, respectively, are continuous and therefore map compact 
subsets of (Y',0') 0 {Y",0") to compact subsets of (Y',0') and (Y",0"), respectively. 
In every point (?/', y") of the open subset Y' x Y" e O'^, 0 O'f,, the mapping id is clearly 
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continuous. It remains to show that id is continuous in all points of the form [Go',y") or 
{y', oo") where y' e Y^, and y" e Y^„. In order to show the continuity in all these preimage 
points of 00 we consider any neighborhood 


V = {rx Y'%\K 


Thm. 2 . 1.1 


of 00 , with arbitrary K e ICA{Y' x Y") 
the set U (h-, X y"„)\(tr'[/f] x _ (I'iV'lA']) x K 


1C{Y' X Y”) and convince ourselves that 
^ X hrstly, 


fulhlls id[f/] = (Y' X F")oQ\(7r'[iC] x 7r"[i^]) c V and, secondly, is an open neighborhood 
of all our preimage points of oo. This shows the second part of the lemma. □ 


Using this Lemma we are now going to prove the announced special case of Theorem 2.6.2; 

Theorem 2.6.5. Let {Y', O') and (U", O") be loeally eompaet Hausdorff spaees and let two 
eontinuous mappings f : (X, Ox) ^ (V', O') 0 (Y", O"), g : (Y', O') 0 (Y", O") (Z, Oz) 

have eompaet eontinuations 

f-.{X,Oy)-^[{Y',O')0{Y",O")]^, 

g:{Y',O')^,0{Y",O")^.-^iZ,Oz). 

Then g o f =: h has a eompaet eontinuation 

h-.{X,Oy)^{Z,Oz), 


if g fulfills 


§«g")-3(g'.rW') 


(2,14) 


for all y' e Yfi, and y" e Yf^,,. If, in addition, c |oo} then h ean he ehosen sueh 

that it fulfills 


h[X\X] C g[{r^'} X y"] U g[Y' X {»"}] U »")}]• (2.15) 


Proof Setting {Y,Oy) ■■= {¥',0') 0 iY",0"), (U, Oy) := [(Y',0') 0 (Y",0")]oo and 
(Y, Oy) ■■= (Y', O')oo' <0 (¥", 0")cx)" we get the theorem as special case of Theorem 2.6.2, 
since all assumptions of its condition iii) and its additional condition hold true, in virtue 
of Lemma 2.6.4 and the condition (2.14). □ 


2.7 Application of the theory to an example 

We agree —oo + (+oo) = +oo + (—oo) = —oo in the following example. Although the 
assumptions in this example prevent the occurrence of the value —oo we nevertheless need 
the stated agreement in order to obtain a continuous addition on [—oo, +oo], cf. Theorem 
2.5.20. 
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Lemma 2.7.1. Assume that the following mappings are given: 

i) Two matrices / linear mappings H : R” —>■ R'^, X : R” —>■ R® with 

U{H)nU{K) = {0}. 

ii) Two proper, lower semicontinuous and coercive mappings (f) : R*^ —>■ [—oo, +oo] and 
-i/; : R® —> [— 00 , +oo]. 

Then the mapping h : R” —>■ [—oo, +oo], given by 

X I—> (j)(Hx) + 'ijj{Kx) (2-16) 

is lower semicontinuous and coercive. In particular, the mapping h attains its infimum 
inf h E [— 00 , +oo] at some point in MT. 


Proof. Due to part ii) in Theorem 2.5.16) our task of proving that h is coercive and lower 
semicontinuous can be done by showing that setting h{co) ■= +oo gives a continuous 
continuation h : (R”,(T®"') ^ ([— oo,+oo], T) of h. We will do this in three steps: Firstly 
we write h as composition h = go f of easier functions g and / and extend them to compact 
continuations g and /. Secondly, a compact continuation h of h is obtained from g and / 
by applying Theorem 2.6.5. Thirdly we convince us that h = h. 

The mapping h can be written as composition h = g 2 0 giof of mappings 

=■9 


/ ; R« ^ R'^ X R^ 

: R'^ X R'^ ^ 

92 : 

which are given by 



92 ■ 


[-oo,+oo] X [-oo,+oo], 
[—00, +00] X [—00, +oo] —> 


\'^iy2)J 



Cil + CI2 • 


[- 00 , + 00 ] 


After equipping the vector spaces R” =: X and R*^ x R® =: T with their natural topology, 
the interval [— 00 , + 00 ] =: Z with the Topology T, and [— 00 , + 00 ] x [— 00 , + 00 ] with the 
corresponding product topology T0T we have continuous mappings /, gi,g 2 and g = g 2 °gi- 
Due to J\f{K) n A/{L) = {0} the mapping / is normcoercive and hence the mapping 


f-.{^l,oT) 


fix) ■■= 


ifix) 


{{W^ X R^)oo,(0®‘'®0®")oo) 
if x e R”- 


00 


if X = 00 
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is a compact continuation by Theorem 2.5.18; furthermore / fulhlls clearly /[M”\M"'] c 
{oo} by definition. Similar, by part ii) in Theorem 2.5.16, we obtain compact continuations 
$ ■ ([-oo,+oo],r) and ^ ([-oo,+oo], T) of ^ and by 

setting 0(oo) := +oo and ■^(oo) := +oo, respectively. These two mappings form a compact 
continuation gl : x ® O®®) —> ([— oo,+oo]^, T®^) of gi. Then 

^ : (M^ X M^, Of ® Of) - ([-», +»], T) 

g-= g2° 91 = '^ + ^ 

is a compact continuation of g. In order to apply Theorem 2.6.5 we note that {Y\ O') ■= 
(M'^, O®'^) and (F", O") := (R®, O®®) are surely locally compact Hausdorff spaces, and that 
the mappings / : (RS^,0®”) - [(F', O') ® (F",0")]^, g : (F',0')oo ®^(F",0")oo - 
([— 00 , +oo],T) have the needed form, where g fulhlls g{(^,y") = = +oo = 

^iy') + "^(go) = g{y\ oo) for all y' e and y" e F^, because (j) and are proper. Applying 
the theorem we obtain a compact continuation 

h : (R'^ X R®)oo —>■ [— 00 , +oo] 


of h with 

h[{oo}] = h[Rf R”] c ^[{oo} X F"] u g[Y' x {oo}] u ^[{(oo, oo)}] = {+oo}, 

i.e. h(oo) = +00 = h(oo). So h = h is indeed a continuous mapping ((R‘^xR^)oo, of ^ 
([-oo,+oo],r). □ 


59 





CHAPTER 


Coercivity of a sum of functions 


Outline 

3.1 Extension of coercivity notions to broader classes of functions. 61 

3.2 Normcoercive linear mappings . 65 

3.3 Semidirect sums and coercivity. 67 


In this chapter we develop a tool (Theorem 3.3.6) which gives information on which snb- 
spaces a sum F + G oi certain functions is coercive. The coercivity assertion of Lemma 
2.7.1 is contained as special case in the coercivity assertion of Theorem 3.3.6 if we set 
F = (j){Hx) = Fi w 0^2 and G = ^Ij^Kx) = Gi w Oyj with Fi ■■= F\xi and Gi ■= G\y^, 
where := X 2 := Af{H) and ■■= n{K*), Y 2 := AfiK), see Detail 8 in the 

Appendix. 

In contrast to the previous chapter we restrict us in this chapter to coercivity notions with¬ 
out regarding e.g. lower semicontinuity at the same time. Moreover the coercivity notions 
in this chapter are rather based on norms instead of compact (or compact and closed) 
sets. In case of vector spaces of hnite dimension there is however a strong relation between 
topological coercivity notions from the previous chapter and the coercivity notions that 
will be given in this chapter, see Lemma 3.1.6 and cf. Theorem 2.5.16. For linear mappings 
between vector spaces of hnite dimension normcoercivity is equivalent to injectivity, see 
Theorem 3.2.1. 


3.1 Extension of coercivity notions to broader classes of 
functions 

So far we introduced the notions of coercivity and normcoercivity only for mappings 
/ : R” —>■ R™', cf. the dehnitions on page 45. We now extend the notion of coerciv¬ 
ity and normcoercivity to broader classes of functions, show that they behave well under 
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concatenation and that the “Cartesian product” of normcoercive mappings is again norm- 
coercive, see Theorem 3.1.3 and Lemma 3.1.4, respectively. Then another extension of 
the notion of coercivity is performed by replacing the codomain [—oo, -l-oo] by a general 
totally ordered set (Z, <). In Lemma 3.1.6 we will see that this coercivity notion is re¬ 
lated to topological coercivity notions involving the spaces (Z, 7^) and (Z, 7^). Finally 
a variant of Proposition 2.4.23 is given in Theorem 3.1.7, saying that a coercive mapping 
^ II • lU) ^ (Z, from normed space of hnite dimension into a totally ordered set is 
already bounded below if it is locally bounded below. 

Definition 3.1.1. Let {X, I’lljf) be a normed space with nonempty subset X X . We call 

a mapping f : X —>■ [— oo, -l-oo] coercive, if and only if 

lim / ( 7 ) = -l-oo. 

||:r||^^ + 0O 

Definition 3.1.2. Let {X, and {Y, ll-Hy) be normed spaces with nonempty subsets 
X X, Y Y . We call a mapping f : X ^ Y normcoercive, if and only if 

lim 11 / (x)||y = -l-oo. 

||x||j5^^ + 00 


(I.e. X ||/(^)||y is coercive.) 

Note in theses dehnitions that functions / are vacuously coercive respectively normcoercive, 
if the domain of dehnition X is bounded; The - more explicitly formulated - dehning 
conditions for coercivity and normcoercivity 

For all sequences in X with —>■ -l-oo we have —> -l-oo. 

For all sequences in X with —>■ -l-oo we have ||/(/^^^)|| —>■ -l-oo 

are namely both trivially fulhlled in that case since a bounded set X contains no sequences 
{x^^i)ken with —>■ - 1-00 as k ^ -l-oo. The following tool is obtained directly from the 

dehnitions. 

Theorem 3.1.3. The following concatenation statements hold: 

i) The concatenation of normcoercive mappings is again normcoercive. 

ii) The concatenation of a normcoercive mapping E : X Y with a coercive mapping 
F : Y [“00, +oo] is coercive. 

In the following lemma we equip the product spaces of 77 x T with the norm || • || := 
II • llxxv •= II • ||x + II • ||y (or any equivalent norm). 
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Lemma 3.1.4. Let (X, || • ||x), (X, || • ||y) and {Z, || • \z), (W, || ■ ||w^) be normed spaees and 
let F : X ^ Z, G : Y W be normeoereive mappings, defined on subsets X and Y of X 
and Y, respeetively. Then the funetion A : X x Y ^ Z x W, given by 


A{x, y) := 



is also normeoereive. 

Proof. In order to prove that A ■. X xY ^ Z xW is normeoereive eonsider an arbitrary 
sequenee (x„, yAnm in X x y with 

IK^^n, ?/n)||xxY —^ +00 (3.1) 

as n —>■ + 00 . We have to show that for any C* e M there is an X e N sueh that for 
all natural n ^ N the inequality ||y4(x„, y„)|| ^ C holds true. Assume that the latter 
statement is not true; then there is a C* > 0 and a subsequenee {xnkXnAkm sueh that 
C > ||A(xnfc,^nJ||zxw = \\F{xnA\\z + ||G(ynJ||w for all A: e N. In partieular we had 

\\F{xnfi)\\z<C and \\G{ynfi)\\w < C (3.2) 

for all k e N. Consequently both (l^n*. ||)fcsN and (l^nfclDfceN would be bounded above by 
some B > 0, see Detail 9 in the Appendix. We thus would obtain \\{xn^,ynA\\xxY = 
ll^nfcllx + ll^nfclly ^ 25 for all /c e N and henee a eontradietion to (3.1). □ 

Definition 3.1.5. Let {X, || • ||) be a normed space and let {Z, <) be a totally ordered set. 
A mapping f : (A, || • ||) —>■ {Z, <) is called coercive iff for any z which is not a maximum 
of {Z, <) there is an R > 0 such that f{x) > z for all x e X with ||a:|| > R, i.e - more 
formally expressed - iff 

'iz e Z\MAX^(Z) > 0 ^xe X :\\x\\> R f{x) > z 


holds true. 

Note in the following lemma that we really mean “73>” in the second condition and that it 
is not a typo. 

Lemma 3.1.6. Let (A, || • ||) be a real normed space of finite dimension and let [Z, ^) be a 
totally ordered set. Eguip X with the topology O which is induced by || • ||. For a mapping 
f : X Z the following are eguivalent: 

i) / : (A, II • II) —» {Z, <) is coercive. 

ii) f:{X,0)-^{Z,n) is topological strongly coercive towards MAX<c(Z). 
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If (Z,^) contains a minimum and a maximum the above conditions are also equivalent to 

iii) is topological coercive towards MAX^(Z). 

Proof. If Z contains less than two elements all three statements are clearly all true and 
hence equivalent. In the following we may hence assume that Z contains at least two 
elements. 

“i) ii)”; Let / : (X, || • ||) — » (^, <) be coercive and let K' e /Cmax^(z)(^, 7^)- 

There is some b e Z’\MAX^(Z) with K' c b], see Detail 10 in the Appendix. Since 
/ : (X, II • II) —» (Z, <) is coercive there is for that b e Z\MAX^(Z) an i? > 0 such that 
f{x) > b for all X e X with ||x|| > R. In other words 

/[X\l^[|| . ||]] c Z\b]. 

Setting K ■= IB_r[|| ■ ||] we hence have found a compact and closed subset of (X, || • ||) with 
/[X\X] c X\b] c X\Kf 

“ii) i)”; Let / : (X, O) {Z, ^) be topological strongly coercive towards MAX<c(2'). 
Let 2 : e Z\MAX<c(2'). The set z] —■ K' is a compact subset of {Z,7'^), cf. Detail 4 
in the Appendix. Moreover K' n MAX^(Z) = 0 so that K' e Since 

/ : (X,0) {Z,T;^) is topological strongly coercive towards MAX^(Z) there is hence a 

K e X{K, O) such that 


/[X\X] c Z\K'. 

Let i? > 0 be so large that Bij[|| ■ \\] ^ K. Then 

/[X\B^[|| .11]]^ f[X\K] c Z\K' = Z\z] = {z. 

In other words we know that for x e X the inequality ||x|| > R implies /(x) > 2 . So 
/ : (X, O) {Z, <) is coercive. Finally assume now additionally that {Z, <) contains 
both a minimum F and a maximum £ Set S' ■= MAX^(Z) = {z}. We have to prove that 
the two statements 

VX' e KsfZ, %) 3K e XA{X, O) : f[X\K] c Z\K', (3.3) 

VL' e XAsfZ, %:) 3K e XA{X, O) : f[X\K] c Z\L' (3.4) 

are now equivalent. In order to prove that (3.4) implies (3.3) it is clearly sufficient to 
show that for any K' e XsfZ,!'^) there is some L' e XAs'{Z,T^) with Z\L' c Z\K', 
i.e. with L' ^ K'. For the inverse implication it is likewise sufficient to show that for any 
L' e XAs'{Z,T^) there is some K' e Xs'{Z,T^) with K' ^ L'. Let hrst K' e Xs'{Z,T^). 
As we have seen in part “i) ii)” there is some b e Z\S' with K' c b]. Clearly 

L' ■■= b] = Z\{b is a closed subset of {Z, T<f). Moreover L' = [z, b] is surely a compact subset 

of {Z, 7 ^); cf. Detail 4 with reversed order or note that z e L' ca be covered by no open 

set from 7 ^, except for the whole space Z ^ L'. So L' = b] fulhlls both L' e XAsfZ, 7 ^) 


64 




3.2 Normcoercive linear mappings 


and L' ^ K'. Let to the contrary L' e }CAs'{Z,%z). Then Z\L' is an open neighborhood 
of S' and contains hence a set of the form (a with some a e Z\S' = Z\{z}. Building 
complements transforms (a c Z\U into L' c a] —■ K' . Again K' is a compact subset of 
{Z,T;^), cf. Detail 4 in the Appendix. Moreover K' does not hit 5" so that it fulhlls both 
A' e ICsiZ, %) and K' ^ A. □ 

Theorem 3.1.7. Let {X, || • ||x) be a normed space of finite dimension and {Z, <) a totally 
ordered set. A coercive mapping F : (A, || • ||x) —^ *■5 already bounded below if it is 

locally bounded below. 

Proof. If dim A = 0, the image T[A] = F[{0}] consists of just one single point, so that F is 
bounded below by that value. If n ■= dim A e N we may without loss of generality assume 
that (A, II • ||x) = (K"', II • II) with some norm || • || on M"". After equipping the totally ordered 
space {Z, <) with the left order topology %> the coercivity of the mapping F : (M”, —>■ 

(Z, <) corresponds to the topological strong coercivity of F : (M”, —> {Z,%>) towards 

MAX<j(Z) by Lemma 3.1.6. Hence Proposition 2.4.23 ensures that the locally bounded 
below mapping F : (M"', C)®"-) —> (Z, <) is even bounded below. □ 

3.2 Normcoercive linear mappings 

A linear mapping dehned in any hnite dimensional space is normcoercive if and only if it 
is injective: 

Theorem 3.2.1. A linear mapping a : X Y of a finite-dimensional normed space 
II ■ lU) into a normed space (Y, || • ||y) is normcoercive if and only if its nullspace M 
just consists ofOx- 

Proof. In the case A = {0} we clearly have J\f = {0}; moreover there is no sequence {xn)nen 
with ||xn|| — >■ + 00 , as n — >■ +oo, so that / is trivially normcoercive. Consider now the case 
A ^ {0}. If A/” contains an element x A Ox then a is not normcoercive since the sequence 
(^«)nsN dehned by := nx fulhlls ||xn||v: —^ +oo but \\a{xn) ||y = n||a:(x) ||v = 0 +oo 
for n —>■ + 00 . To show the other direction we assume that A/” = {Ox}- Then the sphere 
S := {x e A : ||x||x = 1} is niapped by a to a set which omits Oy. For this and by the 
compactness of the nonempty set § we hnd a point x e S with 

min \\a (x) ||y = \\a (x) ||y > 0. 

feS 

By scaling with a positive number A > 0 we see that 

min ||Q:(x)||y = 


mm a ixl 

a;GA§ ' 


A 


( X 
— 

X 

A min \a (x) ||y 

£g§ 


\\Y 


A||q: (x) 


A- 
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This means that ||q; (x) ||y ^ 


X\\x 



>0 


+00 for ||a;||x 


+ 00 , i.e. a is normcoercive. 

□ 


Corollary 3.2.2. Let ^ be linear mappings of a finite-dimensional normed 

space {X, II • ||x) into normed spaces {Y\ || • ||y/), (F", || • ||y«). If their nullspaces Af', Af" 
have only Ox in common then the linear mapping a : X ^ Y’ x Y", given by 



is normcoercive. 


Proof. Since the nullspace Af of a fulhlls Af = Af nAf" = { o^'„ ^ } ''^6 obtain the statement 

= 0y 

by applying Theorem 3.2.1. □ 

Definition 3.2.3. Let X = Xi Q X 2 be a direct decomposition of a real vector space X. 
The linear mapping 'nxi,X 2 '■ X ffi; given by 

'XXi,X2{x) = 7 ^Xi,X2{xi + X 2 ) ■= Xi 

is called projection to Xi along X 2 . If X is equipped with some inner product (•, •) such 
that X 2 = Xfi we will also shortly write nxi ■ 

Lemma 3.2.4. Let X = Xi@ X 2 and X = Wi 0 W 2 be direct decompositions of a real 
vector space X. The following holds true: 


i) The nullspace of ttxi,X 2 Af{xxi,X 2 ) = -^ 2 - In particular, for any subspace Xi of X 
which is also complementary to X 2 , the restriction 'Xxi,X 2 \x^ : Xi ^ Xi is a vector 
space isomorphism between Xi and Xi. 


ii) The linear mapping a : X Xi x Wi, given by 

Mz) 

\XWiW2Y)J 

has nullspace X 2 n IT 2 / in particular restricting a to any complementary subspace Z\ 
of X 2 n 'W 2 yields an injective mapping a\zi '. Zi ^ Xi x Wi. 


iii) //<•+> is some inner product on X such that X 2 = Xfi and W 2 = Wfi then the linear 
mapping a : X Xi x Wi, given by 


a{z) ■= 


f 7^X1 (^)\ 


has nullspace Xfi n Wfi. In particular the restriction a\xi+Wi ■ Hi + hTi Xi x Wi 
is injective. 
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Proof, i) Writing an arbitrarily chosen x e X in the form x = Xi + X2 with uniquely 
determined xi e Xi and X2 e X2 we obtain 

X e M{7rxi,X2) '^Xi,X 2 {xi + X 2 ) = 0 xi = 0 x = X 2 x e X 2 

so that A/”( 7 rxi,X 2 ) = ^2- This implies that the restricted mapping 7 rxi,x 2 ljq : -^1 ^ -^1 is 
injective for any subspace Xi, which is also complementary to X2, since 

■A/'(7^Xl,X2lx^) = f^{'^Xi,X 2 ) n Xi = X 2 n Xi = {0}. 

Due to 

'^^iX2lxi["^i] ^ '^Vi,X2[-^i] = 7 J'Xi,X2[-^i ©-^ 2 ] = 7rxi,X2[-^] = 

the linear mapping xxi,X2\x-^ is also surjective and hence a vector space isomorphism, 
ii) Applying the just proven part twice we obtain for any x e X the equivalences 

a{x) = 0 7^X1,X2{^) = 0 A 'KWi_,W2i.^) = 0 a; e X2 A X e IX2 x e X2 n IX2, 

so that A/'(q:) = X2 n IX2. Likewise as in the already proven part i) this implies that the 
restricted mapping q:|^j : Zi Xi x Wi is injective for any subspace Zi of X which is 
complementary to X2 n 1X2. 

hi) By the just proven previous part ii) we have M{a) = X2 n 1X2 = X^ n W^. Therefore 
and since {Xf n Wf) n (Xi + Wi) = {0}, see Detail 11 , we obtain 

J\f{a\x,+w^) = M{a) n (W + Wi) = (X,^ n W^^) n (W + Wi) = { 0 }. 

Hence a\xi+Wi is injective. □ 

3.3 Semidirect sums and coercivity 

In this subsection we consider functions F,G : R” ^ R u {+00}, which allow a certain 
decomposition into coercive and locally bounded from below parts Fi : Xi ^Ru {+00}, 
Gi : W - R u {+00} and bounded from below parts F2 : X2 —> R u {+00}, G2 ’. Y2 —* 
R u {+00} and prove a sufficient criteria for F + G beeing coercive on a subspace Zi. The 
exact result is stated in Theorem 3 . 3 . 6 . 

The mentioned decomposability of F means more precisely that F can be written as some, 
to be introduced, semidirect sum F = Fi kti F2. The demanded boundedness assumptions 
for F2 and G2 allows us to replace F2 and G2 by the constant zero functions 0^2 and Oyj. 
Working with the simpler direct decompositions Fi w 0^2 and Gi w Oyj is the core of the 
proofs in this subsection. 

Definition 3.3.1. Let X = Xi ® X 2 be a direct decomposition of a real vector space X . 
The semi-direct sum of functions Fi : Xi —> R u {+00} and F2 : X2 ^ R u {+00} is the 
function Fi w F2 : X —> R u {+00}, given by 

(Fi W F2 )(xi + X2) := Fi(xi) + F2 (x2) 
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Remark 3 . 3 . 2 . Although the notation Xi 0 X2 for the underlying spaces suggests the 
similar notation Fi 0 F2 for a pair of functions defined on Xi and X2, respectively, we 
prefer the notation Fi w F2 for the following reason: // Fi : Xi —> M u {+00} and F2 : 
X2 ^Mu{+oo} are mappings with Fi w F2 = Fi w F2 we can in general not conclude 
that Fi = Fi and F2 = F2; for real-valued functions we can conclude only that there is a 
constant (7 e M such that Fi = Fi + (7 and F2 = F2 ~ C, see Detail 12 - moreover not even 
the latter is in general true, if one of the four functions takes the value +00, see Detail 13 . 
But at least we have 


Fi w F 2 = Fi w F 2 Fi = Fi, (3.5) 

if F 2 is real-valued; note here that Fi and Fi have the same domain of definition! 

Lemma 3.3.3. Let M” = Xi 0 X 2 = 17 017 decompositions of ML into subspaces and 
let Fi : Xi R u {+ 00 }, (7i : 17 —>■ R u {+ 00 } be mappings. The following holds true: 

i) For every subspace Xi of R” which is also complementary to X 2 there is exactly one 
mapping Fi :Xi ^Ru{+oo} with 

Fi w 0x2 = 0x2, 

namely the function Fi = Fi o7rxi,x2lx^ = (-^1 0 x 2 )lx^- particular Fi is coercive 

iff Fi is coercive. 

ii) For any subspace Z\ of R” which is complementary to X 2 n 17 == Z 2 we have 

H •■= (Fi w 0 x 2 ) + (G*! w 0 ^ 2 ) = Ri w 0x20^2, 

where Hi := H\z^ = Fi o 7rxi,x2Ui + Gi o t^Yi,Y 2 \zi- If Xi -L X 2 and Yi _L Y 2 holds 
true in addition we can choose Zi = Xi + Yi. 

Proof, i) We first show the uniqueness of Fi. To this end let $1 : Xi ^ R u {+00} be 
a mapping with $1 w 0x2 = -^i®0x2- Clearly the mapping 0x2 is real-valued so that 
we get $1 = Fi by ( 3 . 5 ). Next we show that Fi = Fi o 7rxi,x2ljy^ fulhlls the claimed 
equality Fi w 0x2 = Fi'i) Oxj- To this end we write an arbitrarily chosen a; e R" in the 
forms X = xi -\- X 2 = xi -\- x '2 with xi e Xi,xi e Xi and X 2 ,x '2 e X 2 . Then 7rxi,x2(2?i) = 
7 J'Xi,X2(2^i + ( 2^2 — x' 2 )) = Xi, so that Fi(2:i) = Fi(7rxi,x2(Ti)) = Fi(a;i). Therefrom we 
obtain 

(Fi w 0x2)(a:) = (Fi w 0 x2)(2^i + X 2 ) = Ffxi) + 0 = Fi(Ti) -h 0 = (Fi w 0x2)(Ti x' 2 ) 

= {Fiw0x2){x) 
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as well as Fi o tixi,X2\x.^ = (-^i ® 0^2)IXi since 

Fi o 7 rxi,X 2 lxi(^i) = -^1(3^1) = Fi{xi) + 0 x 2 (x 2 - = (-Fi w ^X 2 ){xi + X2- x'^) 

= (Fi w 0 x 2 )|^^(Fi). 

It remains to show that Fi is coercive iff Fi = Fi o 'Kxi,X 2 \x^ is coercive. To this end note 
that 

7 ’’ •= '^Xi,X 2 \xi '■ 

is a vector space isomorphism by part i) of Lemma 3 . 2 . 4 . Since the spaces Xi and Xi are 
of finite dimension the mapping tt is even a bicontinuous vector space isomorphism. In 
particular the equivalence 

Ilf'/-) II ^ +00 « k(x/’^/|| - +00 

holds true for all sequences (F/’^/neN in Xi so that 

Fi(Fi) —>■ +00 as ||Fi|| —>■ +00, xi e Xi 
Fi( 7 r(Fi)) — >■ +00 as || 7 r(Fi)|| —>■ +00, xi e Xi 
Fi{xi) — >■ +00 as ||xi|| —>■ +00, xi e Xi. 


ii) We first show that Ffi ■■= F[\zi = Fi o 'Kxi,X2\z-i + Gio TTyj^yjki- Writing an arbitrarily 
chosen z[ e Zi in the forms z'^ = x\ + x'^ = |/i + ?// where x\ e W, e X2 and e hi, ^2 ^ 
F2, we indeed get 

= (-^1 ® ^X 2 ){Fi) + {Gi w ^y 2 ){Fi) = (Fi w 0 x 2 )(Fi + x'2) + {Gi w QY 2 ){y'i + vk) 

= Fi{x'i) + Gi{y[) = Fi(7rxi,X2K + A)) + >2(1/1 + y' 2 )) 

= [Fi O 7rxi,X2 + G*! o 7 I'Yi,V2](^i)- 

In order to prove (Fi w 0^2) + {Gi w Oyj) = Hi w 0^20^2 write an arbitrarily chosen 
a; e M" in the forms a; = xi + 0:2 = 1/1 + 1/2 = + ^^2 where each vector is an element of the 

similar denoted subspace. Using 'Kxi,X2{.^i) = 7 i'Xi,X 2 ( 3 ^i + (2^2 — Z2)) = a;i, 7 ryi,y 2 (^i) = yi 
and the previous calculation we obtain 

Hi w 0x2nV2(^) = Hi{zi) + 0 = Fi(7rxi,x2(u)) + lGi('^yi,y2(u)) = -Fi(xi) + 0 + Gi{yi) + 0 
= (Fi w 0x2)(xi + a; 2 ) + (Gi w 0y2)(?/i + 1 / 2 ) = H{x). 

If Xi 1 X2 and Yi 1 U2 we can choose Zi = Xi+ W since (Xi + = X/ n U/ = X2 n ^2 

so that in particular R” = (Xi + W) © (X2 n U2)- Q 
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Theorem 3.3.4. Let R” = Xi ® X 2 be a direet deeomposition of R'^ into subspaees Xi 
and X 2 and let Fi : Xi —» R u {+00} be eoereive and F 2 : X 2 —> R u {+00} be bounded 
below. Every funetion F : R'^ —>■ R u {+00} with F ^ Eiw F 2 is then eoereive on all those 
subspaees Xi of ML whieh are eomplementary to X 2 , i.e. whieh give a direet deeomposition 
Xi 0 X 2 = R” = Xi 0 X2. 

Proof. Since F 2 is bounded below there is a constant m e R with 


-^2(3^2) ^ m 

for all X 2 ^ X 2 . Due to F ^ Fi w F 2 ^ (Fi w Oxj) + m it suffices to show that Fi w 0^2 : 
R*^—^Ruj+Gojis coercive on every subspace Xi of R” which is complementary to X 2 . 
The latter however follows from part i) of Lemma 3.3.3 after hxing any subspace Xi and 
setting Fi ;= (Fi w 0 x 2 )lx^- ^ 

As word of warning note that, in contrast to part i) in Lemma 3.3.3, the previous theorem 
states no equivalence between the coercivity of Fi and Fi := Fi|j^^ but states only that the 

coercivity of Fi carries over to Fi if the assumptions of the previous theorem are fulhlled. 
If F2 is not constant zero the reverse implication is in general not true as the following 
example shows: 

Example 3.3.5. Consider the direet deeompositions R^ = Xi 0 X 2 = Xi 0 X 2 with the 
one dimensional subspaees Xi := R( 1 , 0 )^, X2 := R( 0 , 1 )^ and Xi := R(l,l)’^. Consider 
the funetions Fi : Xi — >■ R, F2 : X2 —» R and Fi : Xi R given by 

F2{x2) ■■= IIX 2 II 2 , 


Clearly F 2 is bounded below. Moreover Fi : Xi —> R is eoereive sinee = 

^ +°o. However the funetion Fi is elearly not 

eoereive. Note that this does not eontradiet the previous theorem since it is not even pos¬ 
sible to write F = Fi w F2 in the form F = Fi w $2 with a funetion $2 : X2 R u {+00}; 
if that would be possible the funetion $2 would actually be finite and the mapping 

g-.X2-^F{{ClY + X2)-F{{t),t)Y + X2) =?\((1,1)^) -F\((0,0)^) 

would be constant on whole X 2 . That is however clearly not the case; for instance we have 
^7((0,0)^) = F((l,l)^) -F((0,0)^) = 1-0 = 1 and ^((0,3)^) = F((l, 4)^) - F((0, 3)^) = 
16-9 = 7. 

Theorem 3.3.6. LeiR” = X 10 X 2 = L 10 T 2 be direct decompositions o/R” into subspaces 
and let Fi : Xi R u {+00}, Gi : Li ^ R u {+00} be both coercive and locally bounded 


Fi := Oxi, 
Fi := 

=:F 
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below and let F2 : X2 —> M u {+00}, G2 : I2 bounded below. Then the sum 

F + G : M” ^ M u {+00} of functions F ^ Fi kt) F2 and G ^ Gi kt) G2 is coercive on all 
those vector subspaces Zi ofMT with M" = Zi 0 {X2 n Y2). In particular F + G is coercive 
on Xi + Yi, if Xi _L X 2 and Yi _L Y 2 hold additionally true. 

Before proving the theorem we give a remark on two important assumptions. 

Remark 3.3.7. It is important to demand locally boundedness of Fi and Gi, see Example 
3 . 3 . 8 . In case of a non-orthogonal decomposition there is no guarantee that F G is 
coercive on Xi + Yi as Example 3 . 3.9 shows. 

Proof of Theorem 3 . 3 . 6 . Since F2 and G2 are bounded below there is a constant m2 e M 
with 


F 2 { x 2 ) ^ m2, ^2(2/2) ^ m2 

for all X2 e X2, 2/2 e Y2. Hence F G ^ (Fi w Oxj) + (GiwOyj) + 2 m 2 , so that it 
suffices to show that (Fi w 0^2) + (Gi Oyj) =• H is coercive on any subspace Zi which is 
complementary to {X2 n I2) == Z2. Concerning the domains of dehnition Xi.,Yi and Zi 
of the mappings Fi,Gi and Hi ■= i 7 |zi, respectively, we may, without loss of generality, 
assume Xi = Xf, Yi = Yf- and Zi = Zf, respectively, see Detail 15 . In order to prove 
that Hi is coercive let any sequence {zk)ken in Zi = (X2 n Y2)'^ = Xf + Yf- = Xi + Yi with 
ll^^fcll —>■ +00 for k —>■ +00 be given. The claimed Hi{zk) —>■ +00 as k ^ +00 holds trivially 
true, if there is a iC e N such that Hi{zk) = +00 for all k ^ K. If there is no such K we 
may without loss of generality assume Hi{zk) e K for all k e N. Since both Fi and Gi are 
bounded below, see Detail 14 , there is a constant mi e M such that 


Fi{x) ^ mi, Gi{y) ^ mi 

for all X e Xi, y e Yi. Therefore and by part ii) of Lemma 3 . 3.3 we obtain 

Hi{zk) = Fi{7Txi{zk)) + Gi{nYi{zk)) 

^ max {Fi (ttxi (zk )), Gi (zk ))} + mi 
'Fi(7rxi(2:fe))\ 


Gi{7rY,{zk)) J 


A{d{zk)) 


+ mi 


+ mi. 


where 


A{x,y) ■= 


Fi{x) 

Gi{y) 


a(z) ■= 


^Y^(z) 


the mappings A : and d : Dg —> D^, are here dehned on the nonempty sets 


Fx {(^1) ?/i) ^ Xi y. Yi : Fi(a;i), Gi{yi) e M} c Xi x Hi 
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and 

Ds := {^1 sZ,=X, + V,: (7rxA^i),^Vi(^i)f ^ ^ Xi + ^ c R-, 

respectively. The mappings A and a are restrictions of the likewise defined mappings 
A : Xi X » (M u {+oo}) x (R u {+oo}) and a : M” —> Xi x Yi, respectively. Due 
to the previous estimate it suffices to show that A o a : Dg —>■ R^ is normcoercive. Part 
iii) of Lemma 3.2.4 ensures that a\xi+Yi is injective. The normcoercivity of a\xi+Yi is 
hence obtained by Theorem 3.2.1 and carries over to d = oi\d~- In order to prove the 
normcoercivity of A we write its domain of dehnition in the form 

^ {(^ 1 ) Vi) ^ ^ • -^ 1 ( 2 ^ 1 ) ^ Gi{yi) ^ 

= {xi e Xi : Ti(a:i) e R} x [yi e Tj : Gi(?/i) ^ 1^} 

V' V' 

=:Y 


and restrict the coercive and hence normcoercive functions Fi and Gi to Fi\y —■ F and 
Gi\y —■ G, respectively. Applying Lemma 3.1.4 to 




(h-) 

[Gi*) 


gives then the normcoercivity of A. Finally the concatenation A o d of the normcoercive 
mappings is again normcoercive by Theorem 3.1.3. □ 

Example 3.3.8. Consider the functions F, G : R^ ^ R given by 


Fixi,X2) 


Setting 


^ /or xi A 0 
0 for xi = 0 ’ 


Gixi, X2) 


X 2 for X 2 A 0 
0 for X 2 = 0 


Xi :=span(ei), 

Yi ■= span(e2) = X 

X 2 :=span(e2), 

Y2 ■= span(ei) = X 

Fi :=F Xi, 

Gi :=G|y, =GU2, 

F2 :=0x2, 

0 

II 

0 

Jl 


we can write F and G as semidirect sums 


F = Fi^ F2 


G = Gi w G 2 . 


Clearly all assumptions of Theorem 3.3.6 are fulfilled - except for one: The function Fi 
fails to be locally bounded below, because of the exceptional point (0,0) e Xi. Setting 


xGl := 
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gives a sequence {x^^'^)neN with —>■ +oo as n ^ +oo for which 

Fix^'^^) + = f—) — 7-j^ + = —n^ + + ^ —>■ —00 ^ +00 

\n/ (-)^ 

as n — >■ +00. In particular the sum F + G is not coercive on the complementary subspace 

Xi + Fi = M2 ofX2 0^2 = {0}. 

Example 3.3.9. Consider the function H : Mf ^ M., given by H{xi, X 2 ) ■= xf and regard 
it with respect to the decompositions 



=-.Xi =-.X2 =-.Yi =-.Y2 


the first beeing an orthogonal one and the second beeing a non orthogonal one. Clearly H 
is coercive both on Xi and Yi. Moreover H is bounded below on X 2 = Y 2 since it is even 
constant there. Setting 

Fi:=H\x„ Gi-.= H\y„ 

F 2 '■= H\x2 = 0 , G 2 '■= F[\y2 = 0 

we can write the functions F ■= H and G ■= H as semidirect sums 

F = Fikti F2, G = Giw G2. 

In accordance with the previous theorem we see that F+G = 2H is coercive on any subspace 
Zi ofMf with m 2 = Zi 0 {X 2 n Y 2 ). However Xi + Yi = M2 is none of these subspaces and 
F + G = 2H is clearly not coercive on Xi + Yi = M2 ^ span(e 2 ). 
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4.1 Unconstrained perspective versus constrained 
perspective 

This section consists of three subsections. In subsections 4.1.2 and 4.1.3, respectively, 
different possibilities of defining 0 ■ (+oo) and the set argmin T of minimizers of a function 
F : M"" ^ M u {+oo} are dicussed among their pros and cons, respectively. We finially 
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choose the dehnitions 

0 ■ (+oo) ;= 0 

and 

argminF := {i; e R” : F{x) < F{x) for all x e R"'}. 

These definitions are suggested when regarding minimizations problems of the form 

Fi + XF 2 min 

from an “uncounstrained perspecitive”, which we prefer to take instead of the alternative 
“constrained perspecitive”. 

Subsection 4.1.1 serves as introduction to the already discussed Subsections 4.1.2 and 4.1.3, 
giving a summarizing and connecting overview of the main ideas presented there, along 
with our concept to keep the gap between the two different perspectives as closed as possible 
in the following sections. 

We hnally mention that we use quite often quotation marks in this section, usually at places 
where, sometimes hidden, unanswered questions lurk. However these implicit questions can 
be ignored when regarding this section just as motivation for our way of dehning 0 ■ (+ 00 ) 
and argminF. 

4.1.1 A kind of dilemma 

Consider for a possibly empty, hxed subset (7 c R"- those pairs of mappings 

F : R”Ru {+ 00 }, f:C^R, 

which are related in a one to one manner by domF = C and F|domF = /• We will also 
write F = f and / = F to indicate that F and / are related in that manner. Two 
things need to be defined; argminF and 0 ■ (+ 00 ). If we want to take an “unconstrained 
perspective” we should define 

argminF := {i; e R” : F(x) < F{x) for all x e R""}, 0 ■ (+ 00 ) := 0. 

If we prefer to take a “constrained perspective” we should define 

argminF := [x e domF : F(x) < F{x) for all x e domF}, 0 ■ (+ 00 ) := + 00 . 

The decision we have to take will turn out to be in a way a dilemma: On the one hand we 
would like the minimization problems argmin F vs. argmin / and “especially” the minimiza¬ 
tion problems argminF = argmin(<I>-l-AT) vs. argmin / = argmin}/-I-A-/), A e [0, -l-oo), to 
be always equivalent. To this end we should choose the dehnitions htting to the constrained 
perspective. On the other hand we would like to avoid a clash with a dehnition of argmin 
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in a general situation and - even more important - want the equation $ + = <l> to 

hold true. To that end we should, however, choose the dehnitions from the unconstrained 
perspective. 

We are aware that it is unfortunately not uncommon to dehne argmin htting to the con¬ 
strained perspective and 0 ■ (+oo) := 0 htting to the unconstrained perspective. We try 
to avoid this mixture of, in general not equivalent, perspectives at the level of dehni¬ 
tions. Instead we will follow the unconstrained perspective here and pursue the strategy 
of imposing conditions in our theorems that ensure at least a weak form of equivalence 
between the unconstrained and the constrained perspective. For instance conditions like 
dom<I> n domd' =|= 0 in Theorem 4.2.6 ensure F ;= <F -I- AT ^ -foo for A e [0, oo), so 
that the unconstrained and the constrained perspective of the minimization problem are 
equivalent here, at least in the sense of argminF = argmin/; for A e (0, -l-oo) we even have 
equivalence in a stronger sense, since 

(<h -f AT/= l> + AT. 

holds in addition. This is, however, no longer true for A = 0, if dom T $ dom $. It 
is the price we have to pay to ensure $ + OT = T without putting further assumptions 
like dom T ^ dom $. Note that a more general version of this inclusion, was assumed by 
Rockafellar in his chapter on Ordinary Convex Problems and Lagrange multipliers, cf. [19, 
p. 273]. 


The following table gives a summarized overview. Some details can be found in the next 
subsections. 



unconstrained perspective 

constrained perspective 

Dehnition of 0 ■ (+oo) 
Dehnition of argmin F 

0 

{x e R” : 

Vx e R” : F{x) ^ F{x)} 

- 1-00 

{x E domF ; 

Vx e domF : F{x) < F{x)} 

argmin F = argmin / 
argmin{Fi -F = 

argmin{Fi s.t. F 2 ^ r} 

for F ^ - 1-00 

for domFi n lev,-F 2 =|= 0 

always 

always 

(Fi -|- AF2) = Fi -f AF2 
Fi -|- OF2 = Fi 

for A e R\{0} 
always true 

for every A e R 

only true if dom F 2 3 dom Fi 

F Isc ^ AF Isc 

for A e [0, -l-oo) 

in general only for A e (0, -l-oo) 


4.1.2 Definition of 0 • (+oo) 

Let (/) : M. and > R be mappings with domains c R*^ and c R^^ 

respectively, and let $ := / and T ;= ■/ denote their natural continuations to functions 
R""—^-Ruj-l-oo}. In the constrained perspective we want $ -I- AT to be the “exact” twin 
of / -I- Xip for all A e [0, oo), i.e. we want 

($ -1- AT)''= T + AT = 0 -F A/) 
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to hold true. For A e (0, +oo) this equation is always fulhlled. For A = 0 it is however 
in general only true, if we would set 0 ■ (+oo) to be +oo; choosing any other value from 
[0, +oo) for this product, let us say the value 0, would cause the domain of dehnition of 
(<h + 0d')''to be different from the domain of dehnition of <h + 0^, if dom 'h $ dom $; Here 
the domain of dehnition of (<F + Od^)'' = $ equals = dom<h, whereas the domain of 
dehnition of $ + O'h is = dom $ n dom 'h c dom <h. 

In the unconstrained perspective we concede $ + A'F a mode of being that is beyond being 
a copy of (j) + \'ip, made up for technical purposes; Here we consider <h, d' and <h +Ad/ in hrst 
line “really” as mappings R” R u {+oo} which all have the same domain of dehnition. 
This allows us to achieve $ + Od/ = $ by setting 

0 ■ ( + 00 ) := 0 . 

With this dehnition we accept that the identity (<F + Ad/)''= <F + AT = 0 + Xtl) may fail for 
A = 0. 

Finally we remark that our dehnition of 0 ■ (+oo) seems to be the “correct” one from the 
viewpoint of lower semicontinous functions; If d/ : R” —>■ Ru {+oo} is lower semicontinuous 
then so is Ad/ for all A e (0, +oo) and also for A = 0, thanks to our dehnition 0 • (+oo) := 0. 
Note that lower semicontinuity would, however, in general not be preserved, if we had 
chosen 0 ■ (+oo) to be +oo in the constrained perspective’s sense; Consider the function 
; (0, +oo) —>■ R, given by xjj[x) ■= K Its natural continuation xp ■. R” —>■ R u {+oo} is 
lower semicontinous, but its product 0-d/ (in the constrained perspective’s sense!) would not 
be lower semicontinous, since its epigraph would be the non-closed set (0, +oo) x [0, +oo). 

4.1.3 Definition of argmin 

Let / ; (7 —> R be some real-valued function, dehned on some subset (7 c R"- and let 
F := / be its natural continuation to a function R” —>■ R u {+oo}. 

In the constrained perspective we regard F as a kind of working copy of /; in particular 
we want the equation argminF = argmin/ to hold always true. Dehning argminF as 
{x E domF ; F{x) < F{x) for all x e domF} would do the job. 

In the unconstrained perspective we, however, want to minimize F “really” over R”, its 
whole domain of dehnition, so that we dehne 

argmin F := {x e R” ; F(x) ^ F(x) for all x e R”} 

We then still have argminF = argmin/, except for the particular case F = -l-oo where we 
unfortunately get argmin F = R” =|= 0 = argmin /. 

Despite this small disadvantage we nevertheless dehne argmin F according to the uncon¬ 
strained perspective - not only because we had already decided us for this perspective 
when dehning 0 • (+oo) := 0 but also for the sake of consistency with the dehnition of 
argmin in the following more general situation; Assume we want to dehne argmin H for a 
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quite general function H : X Y between a (possibly empty) set X and a totally ordered 
set {Y, <y). The natural choice for dehning the (possibly empty) set of minimizers seems 
to be 


argmini4 := {x e X : H{x) H{x) for all x e X}. 

Our de facto dehnition of argminF appears then just as a special case for X = M”, 
Y = (—oo,+oo] with the natural order and H = F. In contrast, the rejected, constrained 
perspective way of dehning argminF would clash to the general dehnition for F = +oo. 

We conclude this section with a remark to the constrained optimization problem 

argmin{Fi s.t. F 2 < r} := {x e M” : F 2 (x) < r and Fi(x) < Fi(x) for all x e levT-F 2 }, 

where r e M and Fi, F 2 : M” —>Mu{+oo}. In the constrained perspective we can always 
rewrite it to argmin{Fi + Liev^F 2 }- In fhe unconstrained perspective we can do this however 
only if Fi + iiev^F2 ^ i-G- if fhe overlapping condition domFi n levT-F2 =|= 0 is fulhlled. 
A similar condition which ensures a stronger overlapping between domFi and levT-F 2 is 
used in part i) of Theorem 4.2.6. The question is also if we should at all speak of the 
’constrained problem’ argmin{Fi s.t. F 2 < r}, dehned as above, in the context of our 
unconstrained perspective, or if we should consider just the problem argminFi + L\ev^F 2 
instead. 


4.2 Penalizers and constraints 

This section consists of three subsections: In the hrst subsection we review general relations 
between the constrained problem 

argmin{<I>(x) s.t. T(x) ^ r} (4.1) 

and the unconstrained, penalized problem 

(-P 2 ,a) argmin{<l>(x) + A4 /(x)}, A ^ 0. (4.2) 

xsR" 

This relation is stated in Detail in Theorem 4.2.6. In the second subsection we add to a 
primal problem, which can be the constrained or the penalized problem, the correspond¬ 
ing Fenchel Dual problem along with conditions that characterize their solutions. In the 
third subsection we discuss Theorem 4.2.6. In particular a relation between one of its 
assumptions and Slater’s Constraint Qualihcation is given. 

4.2.1 Relation between solvers of constrained and penalized 
problems 

In this subsection there are two lemmas and one theorem along with their proofs and 
some examples. The hrst Lemma 4.2.1 is an auxiliary lemma for the second Lemma 
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4.2.3. The latter lemma gives a relation between the subgradients and dis{x*), 

where S ■= lev,i,(j;*)\['. This relation is used to prove Theorem 4.2.6, which gives relations 
between solvers of SOL(Pi,-) and S0L(P2 ,a)- For comments on this subsection see Section 

4.2.3. 

Lemma 4.2.1. Let 4/ : ^ R u {+oo} be a proper and convex function, x* e domT 

and S ■■= lev^(a;*)4/. Let p e ML such that the half-space ^ with a ■= (p,x*} contains S. 
Then we have the equality 


inf T(x) = T(a;*), (4.3) 

if X* E int(dom4/) or if both x* e ri(dom4/) and S is not completely contained in Hp^- 

Proof. For n = 0 the assertion of the Lemma is trivially true. Without loss of generality 
we may therefore assume n ^ 1 in the following. We hrst consider the case x* e int(dom'l/). 
Assume that there exists y e Hp^ such that 4/(|/) < 4/(a;*). Since y.,x* e domT, we see by 
the convexity of 4/ that 

^{\y + (1 - \)x*) < AT(?/) + (1 - A)T(a:*) < T(a:*) 

-V-^ 

for all A e (0,1). Since x* e int(dom4/) we have x\ e int(dom4/) for A small enough. Since 
4/ is continuous on int(dom4/), there exists e > 0 such that the Euclidean ball Be(a;A) cen¬ 
tered at x\ with radius e fulhlls ^ int(dom4/) and 4/(a;) < 4/(a;*) for all x e B£(a;A). 

Hence we obtain by the assumption on S and p the inclusion B£(xa) c S' c Hp^ so that 
Hp^a = 0- This contradicts xx e Hp ^. 

The remaining case can be reduced to this argument: Without loss of generality we may 
assume x* to be the point of origin, so that Hpa ^rid aff(dom4/) =: Lf are vector sub¬ 
spaces of R'^; note herein x* e ri(dom4/) c aff(dom4/). For simplicity of perception we 
may without loss of generality assume further, that p is of the form p = (0,..., 0,1), i.e. 
Hp^a = X {0} and = R'^”^ x (—oo,0]. The level set S c is not completely 
contained in Hp ^. Therefore Hp ^, or rather ■= Hp ^ n U, must separate U in an upper 
part •■= Hp^ n U and a lower part 44^ := 44^^ n U ; note here that 44^ is a hyperplane 
in 4f = aff(dom4/) by Detail 16. Due to 44*= ^ S and since 4/(x) = m.ixeH= 4/(x) 

we can consider 4/ only on U = aff(dom4/) and then argue just as before in this vector 
subspace, using x* to be an interior point of S (considered of course as subset of U). □ 


Remark 4.2.2. 

i) In cases where aff(dom4/) is the full space R"", i.e. where int(dom4/) = ri(dom4/), 
the condition x* e int(domT) is, in general, really necessary to get the equality (4.3) 
as Fig. f.l illustrates. 
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Figure 4.1: Illustration that relation (4.3) is in general not valid for x* e dom'I'\int(dom 4*). 


ii) In cases where aff(dom^) c M"-, i.e. where int(domd/) = 0, the condition x* e 
ri(dom\E') in general really needs to he complemented by the condition S $ to 
get the eguality (4.3), see the second part of Remark 4..2.5 or make the following 
gedankenexperiment: Look at Figure f.l and regard the two dimensional effective 
domain o/4/ as Xi-X 2 -plane ofM.^, i.e. extend the there sketched function 4/ : > 

M u {+oo} to a function ^ ^ M u {+oo} by setting 


i){xi,X2,xfj 


'4/{xi,X 2) if X3 = 0 
+00 if xs =1= 0. 


Move now x* and the line to some place in ri(dom 4/)\argmin 4/ but change the 
direction of if necessary, in such a way that we still have S ■= lev^(j;*)\l/ c Hp ^. 
Consider finally the line ^ as part of a plane withp s R^\{0} and d ■= (p, x*). 
As long as we consider only such planes which are not identical to the X 1 -X 2 - 
plane aff(domd'), but intersect this plane only in Hfa’ everything keeps essentially 
the same as before: Also separates doin']/ at x* e ri(dom4/) into two parts, 
such that S is completely contained in Hp^. Such a separation is, however, no longer 
performed by Hp^ if it is identical to the xi-X 2 -plane. In this case eguation (4.3) is 
clearly no longer fulfilled. 


The following lemma will be used in our proof of Theorem 4.2.6. 

Lemma 4.2.3. Let 4/ : M"" ^ M u {+ 00 } be a proper, convex function, x* e domT and 
S ■■= lev^(a,*)4/. Then we have 


dT(x=^) c dLs{x*). 


(4.4) 
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If X* is not a minimizer of ^ we moreover have 

= Mq i/x* e ri(dom']/), (4.5) 

= RQd4/(x*) i/x* e mt(dom'I'), or in other words (4.6) 

if X* e ri(dom4/) and aff(dom4/) = R”. 


A proof of a similar lemma for finite functions : R” —» R based on cone relations 
can be found, e.g., in [12, p. 245]. Here we provide a proof which uses the epigraphical 
projection, also known as inf-projection as dehned in [20, p. 18+, p. 51]. For a function 
/ : R*^ X R™ ^ R u {+oo}, the inf-projection is dehned by u{u) ■= f{x,u). The name 

’epigraphical projection’ is due to the following fact; epii^ is the image of epi/ under the 
projection (x, u, a) t--> (n, a), if argmin^, /(x, u) is attained for each u e domi^. (Note that 
this is not the projection onto epigraphs as used, e.g., in [2, p. 427].) The inf-projection is 
convexity preserving, i.e., if / is convex, then v is also convex, cf. [20, Proposition 2.22]. 

Proof. 1. First we show that R[[ d'^^x*) c dtsix*)- By dehnition of the sub differential 
we obtain 


ged4/(x*) Vx e R” : (g, x — x*) < 4/(x) — 4/(x*), 

=> Vx e S': (g,x —x*)<0 

Hence we obtain the above inclusion by 

p e dLs{x*) Vx e S': (p, x —x*)<0. (4.7) 

2. Next we prove dis{x*) c ^^/(x*) if x* is not a minimizer of 4/ and the additional 
assumptions in (4.6) are fulhlled, so that x* e int(dom4/). Let p e dis{x*). If p is the 
zero vector, then we are done since ^^(x*) =(= 0. In the following we assume that p is not 
the zero vector. It remains to show that there exists h > 0 such that j^p e d4/(x*). We 
can restrict our attention to p = (0,..., 0,p„)"'" with p„ > 0. (Otherwise we can perform a 
suitable rotation of the coordinate system.) Then (4.7) becomes 

p e dLs{x*) Vx = (x, x„) e S' : p„x„ < p„x*. (4.8) 

Hence we can apply lemma 4.2.1 with p = (0,..., 0,pn)"'" and obtain 

inf 'F(x) = 4/(x*). 

{xeR'^:x„=x*} 

Introducing the inf-projection : R ^ R u {+oo} by 

n{xn) ■■= inf T(x,x„). 

S£R"-1 


this can be rewritten as 

HO = ^iO- 


(4.9) 
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Therefore we have 

^p= E d'i/ix*) Vx e R” : T(x) ^ i^(x*) + ^Pn(xn - O 

VXn e R : l^(Xn) ^ + ^Pn(Xn - X*) 

h 

« ^Pn^dv{xl), 

so that it remains to show that dv{x^ contains a positive number. By (4.9) we verify that 
z^(x*) is finite. Moreover, x* e int(dom\l/) implies x* e int(domz/). Therefore dz/(a;*) =|= 0. 
Let Qn E di'{x*), i.e., 

QniXn - X*) < n(Xn) “ Z^«) 

for all Xn e R. Since x* is not a minimizer of T, there exists y e R"^ with T(?/) < and 

we get by (4.8) that yn < a;*. Since pn = x* would by (4.9) imply that ^/(a;*) = i^ipn) ^ 
4 /( 1 /), we even have pn < x*. Thus 

qnivn - K) < Hyn) “ HK) ~ < o 

implies Pn > 0 and we are donu^_ 

3. Next we prove dcs(x*) Q Rq d4/(x*) if x* is not a minimizer of 4/ and x* e ri(dom4/); 
then taking closures in Rq d4/(a;*) c dts(x*) Q Rq d4/(a:*) gives the wanted dis{x*) = 
Rq d4/(a;*) since dis{x*) is closed. 

We have x* e domt 5 = S' c dom4/, so that both effective domains are in particular 
contained in aff(dom4/) =: A. Applying Theorem B.17 two times yields hence 

dts(a:*) = d(t5U)(a;*) + , 

d^(0) = d(TU)(x'^) + 

where U is the difference space of A. By part 2. of the proof we know d(ts'|A)(a;*) = 
Rq d(4/|^)(a;*). So the claimed dis(x*) Q Rq d4/(x*) is equivalent to Rq d(4/|A)(a;*) + P-^- Q 
Rq [d(4/|^)(x*) + f/-*-] and can hence be proved by showing that the relation 

R^B + VL c R+{B + W) 

holds true for any subsets B,W of R” with Rq W = W. To this end let A e Rq , b e B and 
w eW he given. In case A =|= 0 we have \h + w = \{h + \~^w) e Rq (5 + hL) c R+(B + W). 
In case A = 0 we have Xb + w = Ob + w = limfc^oo + kw) e Rq (5 + W). Thus 
Rq B + IT c R+(_B + W) really holds true. □ 

Remark 4.2.4. The condition that x* is not a minimizer o/4/ is essential to have equality 
in (4.4) as the following example illustrates. The function 4/ given by 4/(x) = is minimal 
at X* = 0 E int(dom4/). ITe have S ■■= lev^(o)4/ = {0} so that 

RQd4/(x*) = {0} c R = di5(x*). 
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Remark 4.2.5. i) The condition x* e dom'k is not sufficient to get equality in (4.4). 
Consider the proper, convex, lower semicontinuous function 'k given by 




—if a: ^ 0, 
+00 if a: < 0. 


The point x* = 0 is not a minimizer of fE' and belongs to dom\l/ but not to ri(dom\l/). 
Using S ■■= lev^(o)4' = Mq 

Mgd\l/(a;*) = 0 c (—oo,0] = dis{x*). 

ii) Even the condition x* e ri(dom4') is not sufficient to guarantee equality in (4.4), if 
aff(dom \1/) is not the full space Consider the proper, convex and lower semicontinuous 
function 4/ given by 


4'(xi,a;2) 


Xi if X2 = 0, 

+00 if X2 0. 


The affine hull aff(dom\E') = M x {0} is a proper subset ofMf. We have S ■= lev 5 ,( 2 ;*)\l/ = 
(— 00 , a;*] X {0} for arbitrarily chosen x* = (a;*,0) e M x {0} = aff(dom\l/) = ri(dom\l/). 
Applying Theorem B.17 to aff(dom\l/) —■ A—■ U yields 

dis{x^) = d(/.5U)(x*) + U^= R+(l, 0)^ + M(0, f)^ 

= {{PuP2f ■ Pi e [0, +oo),p2 e (-oo, +oo)} 

and d\l/(a;*) = (1, 0)^ + M(0,1)"^ = {( 1 ,^ 2 )"^ : P 2 ^ (— 00 , +°o)} so that 

R(j'd4'(a;*) = {(0,0)^} u {(^ 1 ,^ 2 )^ : Pi e (0, +co),P 2 e (-go, +go)}. 


We see that the closure Rq 54/ is just the closed half-plane dis{x*), as guaranteed by Lemma 
4.2.3. However we only have Rg 5\E'(a;*) c dis{x*). 

Concerning Lemma f.2.1 we note that equation (4.3) holds true here if and only if S is 
not completely contained in the straight line where a{p) ■= (p,x*}: Choosing any 

P = {PijP 2 ) with Pi > 0 we see that the line H^a(p) intersects aff(dom4/) only in x*, so 
that we clearly have infj,g//=^ 4/(a;) = 4/(x*). However this equation is no longer fulfilled if 
we choose p in such a way that aff(dom4/) c say e.g. p = (0,1). 


Using Lemma 4.2.3 it is not hard to prove the following Theorem 4.2.6 on the correspon¬ 
dence between the constrained problem (Pi,,-) in (1-2) and the penalized problem (P 2 ,a) 
in (1.3). The core part of the theorem has been restated in Corollary 4.2.7 for proper, 
convex functions $,4/ : R'^ ^ R u {+00}, where the effective domain dom4/ is open and 
contains dom $. In that case the theorem basically states, on the one hand, in its second 
part the following: For A > 0 any x e SOL(P 2 ,a) which does not minimize $, belongs also 
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to SOL(Pi^t-) exactly for r = \l/(x). In its first part, on the other hand, it then states 
something converse: For any x e SOL(Pi^t-) which does neither minimize $ nor there 
exists A > 0 such that x e SOL(P 2 ,a)- To determine this A we will later use duality consid¬ 
erations. In the following theorem we give the rigorous statement and take also the case 
A = 0 into account in both parts of the theorem; note here however that the second part 
of the theorem does not state that for given x e SOL(P 2 ,o) there actually is a r e R with 
X e SOL(Pi^t-), cf. Remark 4.2.10. Before proving the theorem we give also one remark to 
part i) and one remark to part ii), noting that, on the one hand, several Lagrange Mul¬ 
tiplier values for A can correspond to the same levelparameter r, and that, on the other 
hand, several levelparameters r can correspond to one and the same Lagrange Multiplier 
value A. 

Theorem 4.2.6. i) Let $,4/ : R” —» R u {+oo} be proper, convex functions. Consider 
(Pi^T-) for are (inf 4/, -l-oo) with ri(dom<I>) n ri(levT-\h) =|= 0 and let x be a minimizer of 
which is situated in int(dom4/). Then there exists a real parameter A ^ 0 such 
that X is also a minimizer of (P 2 ,a)- This parameter A is positive, if x is in addition not a 
minimizer of $. 

ii) For proper $,4/ : R'^ ^ R u {+oo} with dom<l> n dom'F =|= 0, let x be a minimizer of 
(P 2 ,a)- For A = 0 and r e OP(<I>, 4/) the point x is also a minimizer of (Pi,r) if o-nd only 
if T ^ If X > 0, then x is also a minimizer of (Pi,t) for r := 4/(1;) e OP(<I>,4/). 

Moreover, if 4/ are proper, convex functions and x e int(dom4/), this r is unique among 
all values in OP($, 4/) if and only if x is not a minimizer o/<I>. 


This theorem implies directly the following 

Corollary 4.2.7. Let be a proper and convex function with open effective domain and 
let <h be another proper and convex function with dom$ c domd'. For those x e R'^ which 
do neither belong to argmin <F nor to argmin 4/ the following holds true: 

i) If X E SOL(Pi T-) for some r e (inf 4/, -foo) then also x e SOL(P 2 ^a) for some A > 0. 

ii) If X e SOL(P 2 ,a) for some A > 0 then there is exactly one r e OP($, 4/) such that 
X e SOL(Pi_t-), namely r = 4/(r). 


Before proving Theorem 4.2.6 we give the announced remarks. 


Remark 4.2.8. Part i) of the theorem is not constructive. In general, there may exist 
various parameters A corresponding to the same parameter r as the following example with 
m < —2 and r = 1 shows: Consider the proper and convex functions <l>, 4/ : R —^ R given 
by 4/(a:) := |a;| and 


$(r) 


(x — 2)^ if r ^ 1, 

m(x — 1) -t- 1 if r < 1, 
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where m < —2. Note that $ is differentiable for m = —2. Since argmin^gj^ = {2} 
we obtain c ■= rnirijjgargmin<i> = 2. Having a look at the graph of $ and noting that it is 


xgR 


for all T E (0, 2). On the other hand, we get 


argmin + A|a;|} = 

xsR 


we see 

that 


• t. X < 

t} -- 

= 


if 

cT 

m 

{1} 

if 

X E [2, —m). 

[0,1] 

if 

A = —m. 

.{0} 

if 

A e (—m, + 00 ), 


SO that r = 1 corresponds to X e [2, —m]. It is known that the set of Lagrange multipliers 
X is a bounded, closed interval under certain assumptions, see [19, Corollary 29.1.5] 

Remark 4.2.9. Concerning part ii) of the theorem in case that there are different mini- 
mizers of {P 2 ,x), say xi and X 2 , we notice that 4'(xi) =|= 4^(£ 2 ) can appear as the following 
example shows: For $(x) := |x — 2| and 4'(x) -=1x1 and X = 1 we have 

f —2(x — 1) if a; < 0, 

(^ 2 , 1 ) 4>(a;) + \l/(a;) = < 2 if a; e [0,2], 

[ +2(x — 1) if a: > 2, 

i.e., argmin,j,g]g {4>(a:) + \l/(a:)} = [0,2], Hence we can choose, e.g., xi = 1 and X 2 = 2 and 
obtain = 1^2 = \I/(a; 2 ). 

Remark 4.2.10. As warning we finally note that part ii) of the theorem needs to be 
carefully read in case X = 0, since the assertion Vr e OP($, \1/) : x e SOL(Pi^t-) 
r ^ \l/(x) does not state that there actually is a real r with x e SOL(Pi t-). This can be 
concluded if and only if x e dom^'. In our chosen “unconstrained perspective”, however, 
the occurrence of x f dom 'k can indeed happen. Consider for example the proper, convex 
and lower semicontinuous functions 'k :M—>Ru{+oo} given by 

<k(x) := [x — (—1)]^, fl/(x) := 

Clearly dom<k n dom\l/ 0 is fulfilled. For X = 0, we see that x = —\ is the unigue 
minimizer 0/ <k + 0\1/ = <k. Since x f dom'k we have in particular x f SOL(Pi^t-) for all 
reR = OP(d>,^). 


—Vx if X ^ 0, 

+00 if X < 0. 


Proof of Theorem 4.2.6. i) Let x e SOL(Pi^^) n int(dom 4^), where r e (inf \k, + 00 ). Then 
\l/(x) < r holds true. In case 'k(x) < r the continuity of T in int(dom4/) assures 4/(x) < r 
in a neighborhood of x. Consequently x is a local minimizer of $ and hence also a global 
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minimizer of this convex function. In particular x is a solution of SOL(P 2 ,o)- In case 
'l/(x) = r, we get by Fermat’s rule, the regularity assumption, 5\h(x) =|= 0 and Lemma 
4.2.3 the relation 

0 e d($ + tiev.r^)(x) = d<F(x) + diiev^<;,(x) = d<F(x) + Mg d4/(x). 

This means that there exists A ^ 0 such that 0 e d<h(x) + Ad4/(x) c + AT) (x) so that 
by Fermat’s rule x is a minimizer of (P 2 ,\)- If x is not a minimizer of $, then clearly A > 0. 

ii) Let X e SOL(P 2 ,a)- If A = 0 we have to distinguish - at least in our taken unconstrained 
perspective - two cases: In case x ^ domT and any r e OP($, T) c R neither the point 
X is a minimizer of (Pi,r) nor is r ^ +oo = T(x). So the claimed equivalence holds true in 
this case. In case x e domT this equivalence holds also true for any r e OP($, T) : For 
real r < T(x) neither x e SOL(Pi^t-) holds true nor does r ^ T(x). For real r ^ T(x) we 
have X e SOL(P 2 ,o) = argminT, so that also x e SOL(Pi^t-) is fulfilled. 

If A > 0, we have x e dom<h n domT and get x e SOL(Pi^t-) at least for r = T(x) e 
OP($, T) by the following reason; if there would exist x with $(x) < $(x) < +oo and 
'F(x) < r < + 00 , then we can conclude <h(x) + AT(x) < <h(x) + AT(x), since only finite 
values occur. This contradicts x e SOL(P 2 ,a)- Finally, let in addition $, T be convex and 
X e int(domT). If x is a minimizer of $ then r = T(x) is not the only value in OP(<h, T) 
with X e SOL(Pi^t-), since clearly every r ^ T(x) belongs all the more to OP(<h,T) 
while X e SOL(Pi,-) keeps fulfilled. If x is not a minimizer of <F then there can not exist 
another value f =|= T(x) from OP($, T) with x e SOL(Pi .f); For f > T(x) the condition 
X e int(domT) would imply x e argminT, as we already have seen in part i) of the proof, 
whereas for f < T(x) the point x would not even fulfill the constraint condition. □ 

4.2.2 Fenchel duality relation 

Using duality arguments we will specify the relations between (Pi,t) and (P 2 ,a) for a more 
specific class of problems in Section 4.4. In particular, we want to determine A in part 
i) of Theorem 4.2.6. To this end, we need the following known Fenchel duality relation, 
compare, e.g., [20, p. 505]. 

Lemma 4.2.11. Let <F e Fo(M"'), T e Fo(M”^), L e M™’"- and fi > 0. Assume that the 
following conditions are fulfilled. 


i) ri(dom<I>) n ri(dom T(yuL-)) =|= 0, 

ii) P(L) n ri(domT(yU-)) =|= 0, 

iii) ri(dom$*(—L*-)) n ri(domT*(-)) =|= 0, 

iv) TZ{~L*) n ri(dom$*) =|= 0. 
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Then, the primal problem 

(P) argmin {$(a;) + '^{pLx)}, ^ > 0, (4-10) 

has a solution if and only if the dual problem 

{D) argmin{$=^(-L» + } (4.11) 

psR™ \T J 

has a solution. Furthermore x e M" and p e M™' are solutions of the primal and the dual 
problem, respectively, if and only if 

—pedt^(pLx) and —L*ped^(x). (4.12) 

T 

Proof. Assumptions i) and ii) assure that we can apply [19, Theorem 23.8] and [19, The¬ 
orem 23.9]. Using these theorems, Fermat’s Rule and [19, Corollary 23.5.1] we obtain on 
the one hand 

SOL(P) + 0, 

3x e M” such that 0 e d(<F(-)-I-^/(/iL-)) (x) = d<h(x)-1-^L*d\h(/iLx), 

3x e M” e R™' such that pepd^^pLx) and — L*ped<F(x), 

3x e R” 3p e R'" such that /iLx e dT* ^ and x e d<F*(—L*p). 

Due to the assumptions iii) and iv) we similarly obtain 
SOL(D) + 0, 

^3peR™ such that 0 e d(^$0-L*-) + )(t) = -Ld$0-L"j5) + , 

3p e R™ 3x e R"^ such that x e d<F*(—L*p) and fiLx e dT* > 
on the other hand. □ 

4.2.3 Notes to Theorem 4.2.6 and to some technical assumptions 

In this subsection we discuss mainly Theorem 4.2.6 with respect to two aspects; In the fist 
part we deal with the condition x e int(dom4/) and illustrate its importance - at least in 
the “unconstrained perspective” - by two examples. The second part is dedicated to the 
regularity assumptions used in Theorem 4.2.6 and in [5, Theorem 2.4] and their relation 
to Slater’s Constraint Qualihcation. 




4.2 Penalizers and constraints 


The condition x e int(doni^) in Theorem 4.2.6 

Concerning part i) of Theorem 4.2.6 we note that the condition x e int(dom T) is essential 
- at least in onr chosen “unconstrained perspective”; It can not be omitted as the next 
example shows. We will also see that it can not even be replaced by the weaker condition 
X e ri(domT). 

Example 4.2.12. 

i) Consider the proper, convex and lower semicontinuous functions <h : M —» M, 4/ : 
M ^ R u {+oo} given by 

<h(a;) := [x — (—1)]^, == 

ITe have ri(dom<I>)nri(levT-4/) = (r^,+oo) =|= 0 for every t e (—oo,0] = (inf T, sup T]. 
Furthermore argmin{<I> s.t. T < r} = [0] —■ {x,-} does not intersect {—1} = 
argmin$ for all these r. In case r e (—oo,0) we have Xr e int(dom4/) and - 
as guaranteed by part i) of the previous theorem - there is indeed a A ^ 0 with 
Xr e argmin(<l> + AT) i.e. with ^'{0) + AT'(r^) = 0, namely A = —4r(r^ + 1) > 0. 
In case t = Q, however, such a real A ^ 0 does not exist: For X = 0 we have 
Xt- = 0 ^ {~1} = argmin(<l>) = argmin(<l> + OT) - in our unconstrained perspective - 
and for A e (0, +oo) we have 0 f 0 = d(<h + AT)(xt-) so that Xr f argmin($ + AT) 
as well. 

ii) Consider the proper, convex and lower semicontinuous functions $ : R^ ^ R, T : 
R^ —» R u {+oo} given by 

T(xi, X 2 ) := xj + (X 2 - 1)^, T(xi, X 2 ) := 

For any r e (inf T, + 00 ) = R we have ri(dom T) n ri(levT-T) = R^ n [(— 00 , r) x {0}] =|= 
0. Consider 

Xr e argmin{T s.t. T < r} = argmin T(x) = argmin x^ + 1 x {0} 

xg(— oo,r] X {0} a:iG(—oo,r] 

{(cO)'^} /orr<0 
{(0, 0)^} for r ^ 0. 

In case r < 0 there is even a A e (0, + 00 ) with 

(r, 0)^ = Xr ^ argmin{$ + AT} argmin(x^ + Axi) x {0} = {(—I, 0)^}, 

xisR 



Xi if X 2 = 0, 

+00 if X 2 0. 


—^/x i/x ^ 0, 
+00 if X < 0. 
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namely A = —2r > 0. In case r ^ 0, however, there is no X ^ 0 with (0, 0)^ = Xr ^ 
argmin($ + A^'); On the one hand any A > 0 can not do the job, since argmin(<l> + 
A^) = {(—-1,0)^} ^ (0,0)^ for all A e (0,+oo). On the other hand also A = 0 can 
not do the job, since argmin($ + = argmin<l> = {(0,1)^} f (0, 0)^. 

Regularity assumptions and the related Slater Condition 

In part i) of Theorem 4.2.6 the condition 

ri(dom<h) n ri(levT-\l/) =|= 0, 

from [19, Theorem 23.8] was used as regularity assumption to ensure a certain amount of 
overlapping between the sets dom<h and lev^T. In [5] we used a different condition which, 
however, implies our used condition; that condition was: 

“Assume that there exists a point in dom $ n lev,-4/ 
where one of the functions $ or is continuous.”^ 

Another related regularity assumptions is Slater’s Constraint Qualihcation 

3xo E dom<I> : '^{xq) < r. 


We will shortly discuss the relation between this Slater Condition and the hrst condition 
for functions 4/ which additionally have an open effective domain domT. This additional 
assumption has the following effect on part i) of Theorem 4.2.6; All minimizers of are 
now automatically situated in int(dom4/) and for real r > inf 4/ the regularity condition 
ri(dom<l>) n ri(levT-4/) =|= 0 is equivalent to Slater’s Constraint Qualihcation, by the subse¬ 
quent lemma. In this case, the existence of a Lagrange multiplier A ^ 0 is also assured by 
[19, Corollary 28.2.1]^ if we note [19, Theorem 28.1]. 

Dropping this additional assumption again and returning to our general setting in Theo¬ 
rem 4.2.6 we note that it still might be possible to replace the hrst regularity assumption 
by this Slater Condition; however the latter does in general no longer imply the hrst reg¬ 
ularity assumption; The condition T(a;o) < r in itself does not ensure Xq e ri(levT-4/) as 
Fig. 4.2 shows. Imaging that we choose $ now in a way such that dom$ is a closed 
triangle which has xq as one of its vertices and that dom $ intersects the sketched dom 4/ 
only in xq. In particular dom<I> n lev,-4/ = {xq} so that Slater’s Condition is fulhlled 
here, but our hrst regularity condition ri(dom$) n ri(levT-T) =|= 0 does not hold, since 


^ We took that condition from the book of Ekeland - Temam, cf. [13, Proposition 5.6 on p. 26] with 
caution in case Fi = +co and F 2 = — 00 . 

^ after resetting $ to -I-go outside of domt|/ in order to achieve dom$ c dom as demanded by 
Rockafellar on p. 273; his other demand ri(dom<I)) c ri(dom'I') is then automatically fulfilled since dom'k 
is open here. 
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xq ^ ri(lev,-'I') here. However, in situations where xo e int(doni']/) holds true in addition, 
we have xq e int(donid') n lev<T-\h = ri(levT-\h), by Theorem B.9 and we could state the 
Theorem 4.2.6 also with the extended slater condition 

3xo e dom<h : xq e int(dom4/) and ^/(xo) < r 

by the following Lemma: 

Lemma 4.2.13. Let $,4/ : » R u {+oo} be proper and convex functions and let 

int(dom4/) =|= 0. Then, for any r e R, the following statements are equivalent: 

i) r > inf T and ri(dom$) n ri(levT-4/) =|= 0 

ii) r > inf T and there exists an x' e dom<l> n lev^-T where $ or is continuous. 
hi) There is an xo e dom$ with xo e int(dom4/) and 4/(a;o) < r. 

iv) r > inf T and dom<h n int(levT-'l/) =|= 0. 

Proof, iv) ^ hi) ; Let Xq e dom<h n int(levT-T). Then xq e dom<l> holds banally true. Due 
to int(dom4/) =|= 0 we know that domT has full dimension n, so that Theorem B.9 yields 
Xo E int(levT-4/) = ri(levT-4/) = ri(dom\h) n lev<.rd' = int(dom'l/) n lev<T-4/. hi) ^ ii) ; Let 
there exist Xq e dom<l> n int(dom4/) with 4/(xo) < r. This assures directly r > inf T. To 
see the continuity of iiev.^'i' == x', note that the convex function T is continuous in 

Xq e int(dom T), assuring 4/(a:) < r in a whole neighborhood of Xq. ii) ^ i) ; Let or 
be continuous in a point x' e dom$ n leVi-T. Then at least one of the nonempty, convex 
sets A = dom$ or B = lev^T = domiiev.^'i' contains that common point in its interior; 
say x' E int(H) without loss of generality. Choosing any point y' e ri(i?), as permitted by 
Theorem B.8, we have 

:= (1 — A)y' + \x' E ri{B) 

for all A e [0,1), due to Theorem B.7. So we achieve za ^ ri(i?) n int(H) by choosing 
A e [0,1) close enough to 1. In particular ri(H) n ri{B) =|= 0 holds true, i) ^ iv): Let 
Xq e ri(dom<l>) n ri(levT-4/), where r > inf T. Then xq e dom$ holds banally true. Using 
Theorem B.9 we also obtain xq e ri(levT-']/) = int(levT-4/), again due to the fact that IcVt-T 
has the same full dimension n as domT. □ 


4.3 Assisting theory with examples 

This section provides tools which allow to transfer and refine the general relation between 
SOL(Pi^t-) and SOL(P 2 ,a), as stated in Theorem 4.2.6 resp. Corollary 4.2.7, to the more 
special setting in Section 4.4 with homogeneous penalizers and constraints, resulting in our 
main Theorem 4.4.6 of the last Section 4.4. 

Among this current section’s subsections 
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Figure 4.2: Example where 'I'(xo) < t does not imply xq g ri(levT-4'). 


• 4.3.1 Convex functions and their periods space 

• 4.3.2 Operations that preserve essentially smoothness 

• 4.3.3 Operations that preserve decomposability into a innerly strictly convex and a 
constant part 

• 4.3.4 Existence and direction of argmin(F + G) for certain classes of functions 

the last one is the most important one for that transferring; roughly speaking its Theorem 
4.3.21 ensures, for given A > 0, that the value r = 4/(5?) = ||Tx|| is independent from the 
choice of X e SOL(P 2 ,a), if is additionally essentially smooth and (essentially) strictly 
convex on some affine subset A of aff(dom$). Demanding such essentially smoothness and 
(essentially) strictness properties on <h is done in the setting of the next section, so that 
we can apply directly Theorem 4.3.21 for the primal problems in Subsection 4.4.1. 

For the corresponding dual problems we likewise, for given r, would like the value A = 
||p||h= to be independent from the choice of p e SOL(Di^t-). However we can not directly 
apply Theorem 4.3.21 for the dual problems since here the more complicated, concatenated 
function p i—> $*(—L*p) =: $(p) needs to be considered. In Section 4.4 we will see that 
has similar essentially smoothness and strictness properties as $. So the question remains if 
concatenation with a (not necessarily invertible) linear mapping preserve these properties. 
Luckily this is the case if certain conditions hold true, see Theorem 4.3.12 and Theorem 
4.3.16 in the second and third subsection, respectively. 

For the proof of that helpful Theorem 4.3.16 or rather its Lemma 4.3.15 we will use The¬ 
orems and Lemmata developed in Subsection 4.3.1. 

4.3.1 Convex functions and their periods space 

In this subsection we dehne and deal with the periods space of a convex functions. The 
notion of periods space is closely related to semidirect sums discussed in the previous 
chapter; For a convex funtion F : M” —>■ Mu {-l-oo} and any decomposition M"" = Xi@X 2 of 
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its domain of definition into some subspace X 2 ^ P[F] and some complementary subspace 
Xi we can write F in the form F = Fi^ti 0^2 with Pi = P|xi- In subsection 4.3.3 it will 
be convenient to allow Xi to be also an affine subset of M"". To this end we extend the 
dehnition of semidirect sums from Section 3.3 as follows: 

Definition 4.3.1. Let a nonempty subset X c M"- have a direct decomposition X = Xi@X 2 
into subsets Xi,X 2 ^ M"-, The semi-direct sum of functions Pi : Xi —> M u {+ 00 }, 
P 2 : X 2 —> M u {+ 00 } is the function Pi w P 2 , given by 

(Pi w F2){xi + X 2 ) ■= Fi{xi) + F2{x2) 

The next theorem shows that the periods of a convex function form a vector space. This 
space is equal to the constancy space, dehned by Rockafellar, see [19, p. 69]. 

Theorem 4.3.2 (and Definition). Let X be a nonempty affine subset of ML with under¬ 
lying difference space U c M” and let F : X —>■ Mu{+oo} be a convex function. The 
set 


P[F] := {p E U : F{x + p) = F{x) for all x e X] 

= {p E Lf : F{x + p) = F{x) for all x e aff(domP)} 

of all periods of F then forms a vector subspace of U. We will call it periods space of F. 

Proof. The sets are equal; note herein that in case x f aff(domP) the equation P(x + p) = 
F{x) is anyway fulhlled for all p ^ U, since then neither x nor x-\-p belong to aff(domP), so 
that F{x) = +00 = F{x-\-p). Next we prove that P[P] is a subspace of U by the Subspace 

Criterion. Clearly 0 e P[P]. Furthermore P[P] is closed under addition; Let p',p e P[F] 

be arbitrarily chosen. Then F{x' -\-p' -\-p) = F{x' -\-p') = F{x') for all x' e X and therefore 
p' -\- p E P[P]. Finally P[F] is closed under scalar multiplication; Let p e P[F] and x e X 
be arbitrarily chosen. We have to show that F{x Xp) = F{x) for all A e R, i.e. that the 
function / : R —> R u {+00}, given by /(A) := F{x + Xp) is constant. In case / = +00 this 
is clearly true. In case / ^ +00 we choose any Aq e dom/. Since p is a period of P all 
values /(Aq + k), where k e Z, equal /(Aq) < +00. In particular we have Xq -\- k e dom/ 
for k E Z. Part ii) of Lemma B.l, applied to On = Xq — n, bn = Aq and = Aq + n, where 
n e N, now just says that the convex function / is constant on all Intervals [Aq — n, Aq + n], 
where n e N, and hence on whole R. □ 

Lemma 4.3.3. Let X be a nonempty affine subset ofM^ and let E : X M u {+ 00 } be a 
proper and convex function. For any decomposition aff {dom E) = A®P of aS{dom E) —■ A 
into some affine set yl c R'^ and some subspace P of the periods space P[E] the following 
holds true: 

aff(domP|^) = yl, domP = domP|^ 0 P (4.13) 

intq(domP|q) = ri(domP|q), int 24 (domP) = intq(domP|q) © P (4.14) 

Moreover all the sets in these eguations are nonempty. 
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Proof. Since E is proper we have 0 =|= aff(dom E) = A® P so that A 0 and P =|= 0 as 
well. The inclnsion dom P|^0P c dom P holds true since P(a+p) = E{d) = P|^(a) < +oo 
for all d E domP|^ and all p e P c P[E]. The reverse inclusion domP c domP|^ 0 P 
holds also true, since every x e domP c aff(domP) = A® P can be written in the form 
X = d + p with some p e P and d e aff(domP|^), where we even have d e domP|^, because 
+ P) = E{x) < + 00 . Altogether we have 

dom P = dom P|^ 0 P, 

where P ^ +oo guarantees domP =|= 0, so that domP|^ is nonempty, as well. Due to 
the banal domP|^ c A we get the inclusion aff(domP|^) c aff(A) = A, where actually 
equality holds true, since (a slightly transposed) equation (B.ll) in Theorem B.15 gives 
on the one hand 

aff(dom P|q) 0 P = aff(domP|q 0 P) = aff(dom P) = A 0 P 

- whereas the assumption aff(domP|q) c A would, on the other hand, result in the strict 
subset relation aff(domP|q) 0 P c A 0 P, due to P =|= 0. The therewith proven 

aff(domP|q) = A 


gives now directly 


intq(domP|q) = ri(domP|q), 

where these sets are nonempty by Theorem B.8 Using the latter equation and equation 
(B.8) from Theorem B.15 we dually obtain 


intA(domP) = ri(domP) = ri(domP|q 0 P) = ri(dom P|q) 0 P = intq(dom P|q) 0 P, 


where int^(domP) =|= 0 ensures that also intq(domP|q) is non empty. □ 

Theorem 4.3.4. Let F : ML —» Mu{+oo} be a convex function, P a subspace of the periods 
space P[F] and A, A c M" affine sets with A0P = A0P. Then F ■■= P|q : A —> Mu{+oo} 
and F ■= F\^ : A ^ M u {+oo} are the same mapping, except for an affine transformation 
between their domain of definition: There is a bijective affine mapping d : A ^ A with 
F = F o d, namely the mapping given by d{d) = d^d + p) ■= d. 


Proof. Due to A0P = A0P every d e A can be written in the form a = a + 0 = d{d) +p{d) 
with uniquely determined d{d) e A and p{d) e P. Setting Q;(a) := d{d) gives hence a well 
dedned mapping d : A ^ A. Geometrically speaking each a e A is projected parallel to P 
to a point d = d(d) e A. This mapping is bijective, since it is both injective and surjective: 
Let d(ai) = d(a 2 ) for 01,02 e A. Then 01 — 02 = (d(ai) + p(di)) — (d(d 2 ) +^( 02 )) = 
0 + p(di) — ^( 02 ) =■■ p E P, so that 02 + p = Oi + 0. The directness of the sum A 0 P 
gives thus p = 0 , i.e. 02 = oi. This shows that d is injective. In order to prove the 
surjectivity of d let o e A be given. Thanks toA0P = A0Pwe can write d in the form 
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a = a + 0 = CL:):+p^ with some a* e ^ and p:^: ^ P- Rearranging the latter to = a — p^ 
gives a = <5(0=^). It remains to show that a \ A ^ A is affine. To this end let t e M and 
write arbitrarily chosen hi, 62 e ^ in the form 

hi = hi + pi, 02 = ^2 + p2 

with ai,a 2 ^ A and pi,p 2 ^ P- Then their affine combination 

hi + t{a 2 - hi) = hi + t{a 2 - hi) + pi + t{p 2 - P 2 ) 

is of the same form with hi+t(h 2 —hi) e A and pi+t{p 2 —p 2 ) ^ P, so that ci(hi+t(h 2 —hi)) = 
hi + t{a 2 — hi) = ci(hi) + t{a{a 2 ) — ci(hi)) really holds trne. □ 

Remark 4.3.5. Let F : R” —» R u {+ 00 } be a convex function. Every p e P[F] fulfills 
dom F + p = dom F. 

The previons remark gave a necessary condition for p e P[F]. The following lemma gives 
a snfficient condition. It says that, in case of a proper, lower semicontinnons and convex 
fnnction, we do not have to check the condition F{x + p) = F{x) for all x e R” in 
order to prove p e P[F]: It already snffices to hnd only one single a e domF snch that 
F{x + p) = F{x) for all a; e a + span(p). We note that it is even snfficient to hnd one single 
a E domF snch that F is bonnded above on the line a + span(p) by some real a; this is 
ensnred by [19, Corollary 8.6.1], which contains the next lemma as special case. 

Lemma 4.3.6. Assume that a function F : R” —>■ R u {+ 00 } from ro(R”) is constant on 
a line or point a + span(p) c R^ which intersects domF. Then p e P[F]. 

Proof. In case p = 0 the assertion is clearly fnlhlled. In the main case p =|= 0 we have to 
show that F is constant on every straight line x + span(p) parallel, bnt not identical to 
a + span(p). In case of F = +00 we are done. In the remaining case F|a,+span(p) # +00 
we consider F on the affine plane spanned by the non-identical, parallel straight lines 
X + span(p) and a -f span(p), or rather only on the closed strip 

Sx ■= co([a; -I- span(p)] u [a -I- span(p)]) 

bonnded by these lines. We perform onr task in two steps: Firstly we will show that F is 
constant on every straight line y + span(p) in ri(S'a;) = S'a;\([a; -I- span(p)] u [a -I- span(p)]). 
Secondly we carry this knowledge over to the bonnding line x + span(p) of Sx. The whole 
straight line a - 1 - span(p) belongs to dom(F) as well as at least one point x' e x + span(p), 
since F|a;+span(p) ^ + 00 . Using the convexity of domF we hence obtain 

domF = co(domF) ^ co({F} u [a -I- span(p)]) ^ ri(S'a;), 

i.e. F takes only hnite valnes on every straight line y + span(p) c ri(S'j;). Assnme that F 
is not constant on some line y + span(p) c ri(S'a;), i.e. that there were parameters t,t eW 
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with F{y + tp) < F{y + ip). Defining the function Fy^p : R M via Fy^p{t) ■■= F{y + tp) 
this reads Fy^p{i) < Fy^p{t). Since Fy^p is convex the equation (B.2) from Lemma B.l would 
yield 

^{y + [(1 ~ + At]p) = Fy^p{(l — X)i + Xi) ^ Fy^p(t) + X{Fyp(t) — Fy^p(t)) —>■ +00 

as A —>■ + 00 . In particular there would exist ti,t 2 e R such that 

F{y + t2p) > F{y + pp) ^ F{a) = F{a + tp) 

=-y2 =-yi 

for all f e R. So levp’(yp(F) would contain not only the point yi but also the straight line 
a + span(p). The convexity of leVir(yp(F) would therefore give 

levFfei)(T) = co(levF{yp{F)) ^ co({|/i} u [a + span(p)]) 3 n{Sy^) = ri(S'j^) 

with the nonempty closed strip Sy = co([y + span(p)] u [a + span(p)]). The lower- 
semicontinuity of F ensures the closeness of lev^(j^p(F) so that 

leVir(j^p(F) = (levpiypiF)) 3 ri(S'y) ^y + span(p) 9 ?/ 2 , 

yielding F{y2) < F{yi) which contradicts F{y2) > F{yi). So F is constant on every 
straight line y + span(p) in ri(S'a;), i.e. F{y + tp) = F{y) for all f e R. Applying Theorem 
B.4 to a e domF and an arbitrarily chosen x* = x + t*p e x + span(p) we see that 


F(x*) = lim F((l — p)a + = lim F((l — p)a +/ix + pt p). 

Mil yP '■ -V-" "—V—" 

=-2/m 

The point y^ belongs to the relatively open strip ri(S'a;) for all p e (0,1), so that F is 
constant on the straight line y^ + span(p). Therewith and by Theorem B.4 we obtain 

F{yf, + tpp) = F{yp) = F((l - p)a + px) F{x) 

as p 1. Altogether we have F{x*) = F(x) for all point x* e x + span(p). □ 

Remark 4.3.7. Demanding that F is lower semicontinuous is important to ensure p e 
P[F] as the following example shows: Consider the function F : R^ —» R u {+oo} given by 


F(xi,X2) 


+00 for Xi < 0 
•\ X 2 for xi = 0 

^ 0 for xi > 0 


and regard e.g. a = (3,0) and p = (0,4). Then all assumptions are fulfilled, except for 
the lower semicontinuity of F. Moreover the closed right half plane domF clearly fulfills 
dom F = dom F + p; however p f P[F] since F{0 + p) =|= F(0). 
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Theorem 4.3.8. Let i? : X —» Mu {+ 00 } be a eonvex funetion, defined on an affine subset 
X of ML. For any affine subset A X and its differenee spaee U we have 

P[E]nU^P[E\A]. 

We aetually have P^E^ nU = if in addition E e ro[X] and A n domi? =|= 0. 


Before proving this theorem we show by two examples that both the lower semicontinuity 
of E and the condition ^dridomif =|= 0 are essential to get the equality P[E] nU = P[i?|^]. 

Example 4.3.9. 


i) Consider the funetion P:M^—^-Mul+oo} given by 


E{xi,X2,X3) 


X3 ifx3>0, 

< 0 if X 3 = 0 and X 2 

+00 else . 


0 , 


E is obtained from the mapping M^ —>■ M, a; •—> xs by restrieting its effeetive domain 
to the non-elosed set dom E = Lff^ q u (ei). The proper and eonvex funetion E is not 
lower semieontinuous, so that E f ro(M^). Both the X 1 X 2 plane spanjci, 62 } == A and 
its translate Aq + 63 =; Ah are affine subsets of M^ that interseet domP. Although 
they have the same differenee spaee U = A the periods spaees P[P|yi] and P[i?|^'] 
are different; more preeisely 


P[E\a] = P[E] r.U<x P[E\a'] 

holds true: Clearly P[P] r\ U = span(ei) r\ A = span(ei) = P[P|yi]. However 
P[P] Ci U = span(ei) c span(ei,e 2 ) = P[P|a']- 

ii) Consider the funetion P:M^—»Mu{+oo} given by 


E{xi,X2,X3) 


X 3 if X 3 ^ 0 and X 2 = 0, 
+00 else . 


E is obtained from the mapping M^ —> M,a; •—> X 3 by "restrieting" it to the elosed 
half-plane domP = {(a:i, 0, 3 : 3 ) e M^ : a;i e M, a^s < 0}. Defining A, A' and U as above 
we have A n domP = span(ei) =|= 0 but A' n domP = 0 . Clearly P[P] n P = 
span(ei) = P[P|a]- However P[P] r\U = span(ei) c span(ei,e2) = U = P[P|yi/], 
sinee E\a' = + 00 . 


Proof of Theorem f.3.10. Let p e P[P] n U. Then 

E{x + p) = E{x) 
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for all X s X. For a\\ x s A we have x + p s A and hence 

E\a{x + p) = E{x + p) = E{x) = E\a{x), 

for all X e yf c X. This shows P[E] n c Let now the additional assumptions 

be fulhlled and let p e P[i?|yi]. Then p e U. Since E\a # +oo, and E{x + p) = E{x) for 
all X e A we see by part ii) of Lemma B.l that E is in particular constant on any line 
a + span(p), a e A which intersects the nonempty set domi?|^. Lemma 4.3.6 gives thus 
p e P[E] so that p e P[E] n U. This shows that also the reversed inclusion P[P|yi] c 
P[E] n U holds true under the additional assumptions. □ 

4.3.2 Operations that preserve essentially smoothness 

Roughly speaking essential smoothness is preserved when performing the following oper¬ 
ations on an essentially smooth function H : A —> Mu{-l-oo}, dehned on some affine 
subspace A of M"'; 


• Restrictions H\^ to an affine subspace A oi A which intersects ri(domi4) 

• Extensions F of of the form F = H kti Op 

• Forming concatenations F = H o M with a linear mapping whose range intersects 
ri(domL4), 


see Lemma 4.3.10, Lemma 4.3.11 and Theorem 4.3.12. 

Lemma 4.3.10. Let A be an affine subspace ofW^ and P : 4 R u {+oo} be essentially 
smooth. The restriction P|^ of F to an affine set A ^ A stays essentially smooth, if A 
intersects ri(domP)[= intA(domP)]. 


The condition A n ri(domP) =|= 0 is essential to preserve the essential smoothness when 
restricting P to A. Cf. example 4.3.13. 


Proof of Lemma f.3.10. By dehnition of “essentially smooth”, cf. [19, p. 251] and nearby 
explanations, see [19, Lemma 26.2] and cf. [19, p. 213] we have 

a) intA(domP) 0, 

b) F is differentiable in every x e intA(domP) = ri(domP) and 

c) the directional derivative P'(x -I- A(a — x);a — x) —>■ —oo as A \ 0 for every x e 
^^(domP) = rb(domP) and every a e intA(domP) = ri(domP) 
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Set F •■= F\g. Then domF = A n domT, so that equation (B.7) in Theorem B.IO gives 
aff(domF) = A Ct aff(domF) = A n A = A, ensuring intq(domT) = ri(domT) and thus 
dq(domF) = rb(domF). Equation (B.4) of the same theorem gives 

a) intq(domF) = ri(domF) = ii{A n domF) = A Ct ri(domF) =|= 0. 

Due to intq(domF) = A n ri(domF) c ri(domF) = intA(domF) we know that 

b) F = F\^ is differentiable in every x e intq(domF). 

Since equation (B.6) from Theorem B.IO ensures dq(domF) = rb(domF) = rh{A n 
domF) = A Ci rb(domF) c rb(domF) = ^^(domF) we hnally - still - have 

c) F'{x + A(a — x)] a — x) = F'{x + A(a — x)] a — x) —> —oo as A \ 0 for every 
X e dq(dom.F) c ^^(domF) and every a e intq(domF) c int^(domF). 

Therefore F\g = F is essentially smooth. □ 

Lemma 4.3.11. Let F : M” —> M u {+oo} be a convex function and let aff(domF) be 
decomposed as direct sum aff(domF) = A@P of some affine subspace A of ML and some 
vector subspace P of the periods space P[F]. Then the following are equivalent: 

i) F is essentially smooth on A® P = aff(domF). 

ii) F is essentially smooth on A. 

Proof. Assume without loss of generality that A is placed in a way that it even is a vector 
subspace of M” and set A := A0P = aff(domF) = span(domF), / := F\a and / := F|^. 
We are going to show the following; 

intA(dom/) + 0 ^ intq(dom/) 0, 

/ is differentiable in every a e intA(dom/) 

$ 

/ is differentiable in every a e intq(dom/) 

In case that / and / are differentiable in int/i(dom/) and in intq(dom/), respectively, we 
will hnally show 


(4.15) 

(4.16) 


P/laJU^R ^ +GO for all (afc)fc e BS{domf) 

$ (4.17) 

P/laJU^R ^ +00 for all {dk)k e BS{domf) 
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where -BS'(dom/) consists of those convergent sequences in intyi(dom/) whose limit point 
belongs to the relative boundary ^^(dom/). BS{domf) is defined accordingly. 

Note first that dom/ = dom/ 0 P gives by Theorem B.15 the equality A ® P = A = 
aff(dom/) = aff(dom/) ©P. Using A ^ aff(dom/) we hence get A = aff(dom/). By the 
very same theorem we obtain analogously ^^(dom/) = rb(dom/©P) = rb(dom/) ©P = 
dq(dom/)©P and intA(dom/) = ri(dom/©P) = ri(dom/)©P = intq(dom/) ©P. The 
latter equality already shows that (4.15) is true. In order to prove (4.16) we will make use 
of unique decompositions a = d + p and h = h + q of a, heA into d,h s A and p,q e P. 
Assume hrst the differentiability of / in an arbitrarily chosen a e intA(dom/); i.e. that 
there exists a (unique) linear mapping P>f\a : A —>■ R and a function : A ^ R, which is 
both continuous in 0 and fulfills ra(0) = 0, such that 

/(a + h) = f{a) + Vf\a{h) + raih)\\h\\ 

for all sufficiently small h e A. For any d e intq(dom /) we have a = a + 0 e intq(dom /) © 
P = int/i(dom/). So the latter formula stays valid for a = d and all sufficiently small 
h = h E A A. Therefore / = /|q is also differentiable with VflAh) = Vfa{h) for 
all h E A. Assume now to the contrary the differentiability of / in an arbitrarily chosen 
d E int^(dom/); i.e. that there is a (unique) linear mapping P/|a : A —>■ R and a function 
ha : A ^ R, which is both continuous in 0 and fulfills ra(0) = 0, such that 

/(a + h) = /(a) + VflaCh) + fdCh)\\h\\ 

for all sufficiently small h e A. Any a e int^(dom/) = intq(dom/) © P can be written 
uniquely as a = a + p with d e intq(dom/). For /i =|= 0 the translational symmetry of / in 
directions of P therefore gives 

/(a + h) = /(a + h) 

= /(«) + T^flaCh) + fdCh)\\h\\ 

= /(«)+ 'Pllajh) + 

=:La{h + q) = La(h) '' ' 

-■ra{h+q)=ra{h) 


Clearly La : A ^ R is a linear mapping; so we need only to show that extending Ta '■ 
A\{0} —» R via ra(0) := 0 yields a function A —^ R which is continuous in 0. Lemma 
A. 2 says that there is a constant C* > 0 such that -p^H = ^ C- Consequently 

\'fa{h)\ = \raih + g)| < C\fd{h)\ ^ 0 as h —» 0 (i.e. as the components h,q 0 ). Thus / 
is differentiable in a and 

VfUh) = LAh + q) = Vf\Ah). 

We finally proof that (4.17) holds true (under the there stated differentiability assumption). 
For these purpose we will use the found relation between the derivatives of / and /. For 
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any a = a + p e int^(dom/) 0 P = intA(dom/) we have \Vf\a{h)\ = \Vf\a{h)\ for all 
h e A with ||A|| = 1. In particular iP/lallA^K ^ ||’P/|a|U^R on the one hand. Using again 
the inequality ||/i|| < C\\h + p\\ = C\\h\\ from Lemma A.2 we get \Vf\a{h)\ = |P/|a(/i)| < 
||P’/|a||A^Kll^ll < C\\'Df\a\\A^M.\\h\\ for all he A, so that ||'D/|a||A^K < C'||P>/|a||q^jj on the 
other hand. Noting that the constant C does not depend on the choice of a we have in 
total 

||P/|a||A^R ^ ||’P/|a||A^R ^ C'||P/|a|| a^K 

for all a = a + p e A. Therefrom and by using dA(dom/) = dq(dom/) 0 P and 
intA(dom/) = intq(dom/) 0 P we hnally obtain (4.17). □ 

Theorem 4.3.12. Let the convex function P : R u {+oo} be essentially smooth 

on aff(domP) and let M : MP R” be a linear mapping whose range TZ{M) intersects 
ri(domP). Then the concatenation F ■= EoM : R™' —>■ Ru{+oo} is convex and essentially 
smooth on aff(domP). 

Proof. The linearity of M transfers the convexity of E to P. Consider the restricted 
functions P := E\tz(m) and P := F\tz(m*)- Since 1Z{M) n ri(domP) =|= 0 we can apply 
Lemma 4.3.10 to see that P is essentially smooth on 

Ae ■= TZ{M) n aff(domP) = aff(77(M) n domP) = aff(domP), 

where TZ{M) n aff(domP) = aff(77(M) n domP) holds true by Theorem B.IO. The 
equation 

F = EoM, 

where M ■■= elucidates that P and P are the very same mapping - except for the 

bijective linear transformation M : TZ{M*) TZ{M) between their domains of dehnition. 
Hence P is likewise essentially smooth on 

M~^\Ae^ = M”^[aff(domP)] = aff(M“^[domP]) = aff(domP) =: Ae- 

Applying Lemma 4.3.11 to P, Ae '■= aff(domP) = TZ{M*) n aff(domP) and P := A/'(M) 
we hnally see that P is essentially smooth on aff(domP) = aff(domP) 0A/'(M), since 
E\a = P is essentially smooth on aff(domP); note here that the validity of aff(domP) = 
aff(domP 0 A/'(M)) = aff(domP) 0A/'(M) is guaranteed by Theorem B.15. □ 

We give two related examples to illustrate the role of the assumption 77(M) n ri(domP) =|= 
0. Although we start with an example where this assumption is not fulhlled but where 
P o M is never the less again essentially smooth, we will see in the second example that we 
in general can not replace that assumption by the weaker assumption 77(M) n dom P =|= 0. 
We use the notations H and Q for the open upper half plane {tc e R^ : ZC2 > 0} c ^ ^ 
and the first open quadrant { 2 ; e R^ : 2:1 > 0, 22 > 0} c C, respectively. 
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Figure 4.3: Graphs and contour lines of Ea or rather ga- Left for a = j e (0,^) and right for the border 
case a = ^, where ||V(/q.(z *'*^)||2 ^ +oo for all boundary points z = limfe^+oo of dornF^i, except the 
origin (0,0). For better quality of the plot a smaller step size was used near the X-axis and the Y-axis, 
where the norm of the gradient of ga is large. 


Example 4.3.13. Consider first the function Qa :EI-^Ru{+oo} on the closed upper 
half plane H, defined by ga{w) ■= —wf = with some parameter a e (0,+oo). 

Continuing ga by setting 

^ ^ I Qaiu)) = —wf for w e El 

EJw) := ^ 2 ./ 

(+00 forwsR^m 


we obtain a function Eq, : ^ M u {+oo}, which is convex and essentially smooth for 

a E (0,1). Concatenation with the linear projection M : —> R x {0}, M{z) ■■= ( 2 : 1 , 0 ) 

yields the mapping Fa = EaO M ; here Fa{z) = 0 for all e R^ elucidates that Fa is both 
convex and essentially smooth, although TZ{M) does not intersect El = ri(domi?o). 


The essentially smoothness will, however, be no longer preserved by concatenation with M 
if we transform pa’s domain of definition, i.e. the upper closed half plane El c r2 = C, 
to the first closed quadrant Q by means of the bijective mapping h \ Q ^ M., given by 
h{z) ■■= ^z"^ = {\{zl — Z 2 ),ziZ 2 ): The function pa ■= Pa ° h : Q M, where ga{z) = 
— {ziZ 2 )°‘ = —zfzf and a e (0, + 00 ), is infinitely differentiable in Q and continuous in Q. 
Its Hessian 


Ha 


az. 


^ 0^—2 ^ a—2 


(1 - a)z‘^ 

—aziZ2 


-aziZ2 \ 
(1 - a)zl) 


is positive definite for all z e Q, if a e (0, |) by virtue of Sylvester’s criterion. Therefore 
the continuous function Pa is strictly convex in Q and convex in Q for a e (0, |). For 
these a we furthermore have \\Vga{z ^’^^)\\2 —>■ +00 as k ^ +00 for any sequence {z^^^)ke-n 
in Q, converging to some boundary point 2 ;^®^ of Q, see Detail 17. Altogether we see that 
continuing Pa by setting 


Ea{z) 


ga,{z) =-zfzf ifzsQ 
+00 if z E R^\(5 
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leads to a function ^ M u {+ 00 } which is convex and essentially smooth for 

a e (0, |). However Fa = Ea o M = 6[o,+oo)xr is not essentially smooth; here 71{M) 
indeed intersects only dom Ea but not the relative interior of this effective domain, which 
is consistent with Theorem 4.3.12. 

4.3.3 Operations that preserve decomposability into a innerly 
strictly convex and a constant part 

Before giving an overview over the current subsection we need to introduce a manner of 
speaking, in which we use the extension of semidirect sums Ei w E 2 from Dehnition 4.3.1. 

Definition 4.3.14. Let Xi be a nonempty affine subset o/R”. We call a function Ei : 
Xi -> R u {+ 00 } innerly stricly convex iff Ei is strictly convex in ri(domi?i) = 
intafr(dom£;i)(domi?i). Any semi-direct sum E = Ei^ E 2 '. X {+ 00 } of an innerly 

strictly convex function Ei \ Xi Ru{+oo} and some constant function E 2 : X 2 ^ R, de¬ 
fined on some vector subspace X 2 will also be called decomposition of E into an innerly 
strictly convex part Ei and a constant part E 2 . 

Roughly speaking we show in this subsection that the following operations on a proper 
convex and lower semicontinuous function i? : X ^ R u {+ 00 } yield a new function which 
still has a decomposition into an innerly strictly convex part and a constant part; 

• Restrictions E\b to an affine subspace B ^ A =■ aff (dom E) which intersects ri(dom E), 

• Forming concatenations F = E o M with a linear mapping whose range intersects 
ri(domi?), 

see Lemma 4.3.15 and Theorem 4.3.16, respectively. 

Lemma 4.3.15. Let i? : X —> R u {+ 00 } be a proper, convex and lower semicontinuous 
function on some nonempty affine subset X c and let there exist a decomposition 
aff (dom i?) = A@P 0 /aff (dom i?) =: A into a subspace P of P\E^ and an affine subspace 
d c such that E is strictly convex on int^(domi?|^). Then 

i) In fact we even have P = P\E^. 

ii) Any affine subset B A that intersects ri(dom E) has a decomposition B = 13 @ Q 
into a vector subspace Q ^ P = P[E] and some affine subspace B c R^ such that E 
is strictly convex on int 5 (domi?|^). 

Moreover inig{Aom.E\A) = ri(domi?|^) and int^(domi?|^) = ri(domi?|^) are nonempty 
sets. 
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Proof. Since E is proper and convex we have A 0 and int^(donii?|^) = ri(donii?|^) =|= 
0 by Lemma 4.3.3. Dne to B nri(domi?) =|= 0 the fnnction E\b is still proper and convex; 
so the same Lemma gives also B 0 and intg(dom£'|j 3 ) = ri(domi?|g) =|= 0. 

i) Since P is a snbspace of the periods space P[E] we clearly have P c P[i?]. The reverse 
inclnsion P[E] c p also holds trne: Let p e P[E] and chose any oq e int^(dom i?|^) and 
think of it as new origin. Since E{ao + p) = E{ao) < +oo we have ao + p e domP c 
aff(domi?) = A® P, so that ao + p = a + p, for some d e A and p e P. Hence a — oq = 
p — p E P[E]. The affine combination oq + A(a — Oq) still belongs to A for all A e M and 
hence even to int^(domi?|^) for all snfficiently small chosen A > 0. Choose such a A > 0 
and consider the possibly degenerated line segment co(ao, ao + A(a — ao)) ^ int^(dom Pl^). 
On the one hand E is strictly convex on the latter set and hence on our line segment. On 
the other hand d — ao e P[E] means that E is constant on this line segment. Both can be 
true only if our line segment is degenerated to one single point, i.e. if ao = ao + A(a — ao). 
This gives 0 = d — ao=p — p, so that indeed p = p e P. 

ii) Let bo E B Ci ri(dom£') = ri(domi? n I?) = ri(domP| 5 ), where we used Theorem B.IO. 
Without loss of generality we may assume bo = 0; otherwise we could replace E by E{- — bo) 
without changing the truth value of the other assumptions and assertions of the lemma. 
By Theorem 4.3.8 and the already proven part i) we then have 

Q := P[E\b\ = P[E] n P C P[E] = P. 

Choose now hrstly any subspace P of P with P = P 0 Q, then some subspace Q' of 
P[P] = P with P = Q@Q' and hnally some subspace B' of A with 

A = B 0 (P 0 Q ® Q') ~ B 0 Q 0 Q' ■ 

=:4 =P 

By Theorem 4.3.4 we know that E ■= E\^ and E ■= E\^ are the very same mapping, 
except for a bijective affine transformation d \ A ^ A between their domains of dehnitions, 
which links these functions via E = E o d. Consequently E is strictly convex on a subset 
5 c H if and only if E is strictly convex on (5[>S] =: S. Choosing S ■= intq(domP) = 
intq(domP|^) we see that E is strictly convex on Q![intq(domP)] = int^((5[domP]) = 
int^(domP) = int^(domP|^). So P = B0Q would give the needed decomposition, 
if int^(domP Ib) — int^(domP|^) can be verihed. Due to P c yf it suffices to show 
int^(domP|^) = ri(domP) n P and int^(dom P|y;) = ri(domP) n A. In order to prove 
the hrst equation we note that P intersects ri(domP) in bg = 0 so that equation (B.7) in 
Theorem B.IO gives aff(domP|^) = aff(domP n P) = aff(domP) n P = P. Therefore 
and by equation (B.4) in Theorem B.IO we indeed get int^(domP|^) = ri(domP|^) = 
ri(dom P n P) = ri(dom E) n P. Just analogously we obtain intq(dom P|q) = ri(dom E) n 

A. □ 

Theorem 4.3.16. Let E : M” —>■ Mu {+oo} he proper, convex as well as lower semicontin- 
uous and let M : M”^ ^ M” he a linear mapping whose range TZ{M) intersects ri(domP). 
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Assume further that there exists a decomposition 

aff(dom E) = Ae © Pe 

of aS{domE) into a subspace Pe of P[E] and an affine subspace Ae ^ M" such that E is 
strictly convex on int^ (domi?|^ ). Then the function E ■= E o M : » R u {+ 00 } is 

again proper, convex and lower semicontinuous and there exists a decomposition 

aff(domF) = Ae © Pe 

o/aff(domF) into a subspace Pe of P[E] and an affine subspace Ae ^ R”^ such that E is 
strictly convex on mt^^(domF|^^). 

Remark 4.3.17. Note that Lemma 4.3.3 implies that all sets that occur in the above 
theorem are nonempty. 

Proof of Theorem 4-3.16. The mapping F = E o M is surely again convex and lower semi¬ 
continuous. Due to TZ{M) n domi? ^ 77(M) n ri(domi?) 7 ^ 0 it is also again proper. 
Since R™' = TZ{M*) ©A/'(M) and since clearly Af{M) c P[F] we have 

domF = domF| 7 ^(M*) ©A/'(M). 

It suffices to prove that there is a decomposition 

aff(domF|7e(M*)) = Ae®Qe (4-18) 

with a subspace Qe ^ PiPlniM*)] and some affine subset Ae ^ R™, such that F is strictly 
convex on int^^(domF|^^), since this decomposition then yields, by virtue of equation 
(B.ll) in Theorem B.15, the needed decomposition 

aff(domF) = aff(domF|7^(M*) ©A/”(M)) 

= aff(domF|7j(M*)) ©A/"(M) 

= Ae ® Q F ® 

-V-^ 

=:Pf 

note herein that not only is a subspace of F[F] but also Qf'- Let q e Qe ^ 

PlFlniM*)] and write every x' e aff(domF) in the form x' = a'+q'+n' with a' e Ae, q' e Qf 
and n' e Af{M) c P[F]. Since a' + q's aff(domwe then indeed obtain 

F{x' + q) = F{a' + q' + q + n') = F{a' + p + q) 

= F{a' + q) = F{a' + q' + n') = E(x') 

for every x' s aff(domF), i.e. q e P[F]. In order to prove that a decomposition as in 
(4.18) really exists we consider the restricted functions P := F\ti(m*), E ■= E\'pi(M) and 
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M ■■= The equation F = E o M then elucidates that F and E are the very 

same mapping - except for the linear homeomorphism M : TZ{M*) —> TZ{M) between their 
domains of dehnition. Hence our task to prove that there is a decomposition as in (4.18) 
is equivalent to prove that there exists a decomposition 

aff(domi?|7j(M)) = Be®Qe 

of aff(domi?| 7 ^(M)) into a subspace Qe ^ P[_E\'ii(^m)\ and some affine subset Be ^ K” such 
that E is strictly convex on int^^(domi?|^^). To this end we set 

B ■■= aff(dom£'| 7 ^(M)) = aff(7^(M) n domi?) = TZ{M) n aff(domi?) c aff(domi?) =: A, 

where we have used again equation (B.7). The decomposition A = Ae ® Pe fulhlls the 
assumption of Lemma 4.3.15. Part ii) of this lemma gives now a decomposition 

aff(domT;|7e(M)) = B®Q, 

where H c M" is an affine subset such that E is strictly convex on int^^(domi7|^^) and 
where Q Pe B[E]. Setting Be ■= B and Qe '■= Q we are done, since the demanded 
Q c P[E\ti{m)] really holds true: Due to the banal b + Q B®Q aff(domi7|7^(M)) = B 
for any b e B ^ B we see that Q is a subspace of H’s difference space B — b —■ U. Thereby 
and by Theorem 4.3.8 we now indeed obtain Q = Q nU ^ P\E\ n c PYE\b\. □ 

4.3.4 Existence and direction of argniin(E' + G) for certain classes 
of functions 

The next lemma gives a necessary criterion in order to ensure that a function of the form 
E + G has a minimizer. The core of the proof consists in showing that the convex function 
E + G has a bounded nonempty level set, i.e. is coercive. The inequality from Lemma A.2 
helps in part ii) 

Lemma 4.3.18. Let M” be decomposed as direct sums MA = Ui ® U 2 and = Vi 0 V 2 
of vector subspaces Ui,U 2 and Vi,V 27 respectively. Let F,G s ro(R") be functions which 
inhere the translation invariances 

E{x) = F{x + U 2 ), 

G{x) = G{x + V 2 ) 

for all X E R"', U 2 e U 2 and ^2 e V 2 - Then the following holds true for levels a, fl e R.- 

i) levQ(F) n lev/ 3 (G) is empty or unbounded, if U 2 n V 2 ^ {0}. 

ii) levQ,(F) n lev^(G') is hounded (possibly empty), if U 2 r\ V 2 = {0}, and levQ,(F|; 7 j, 
lev/ 3 (G|yj) are hounded. 
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iii) F + G takes its minimum in i/dom F n dom G =|= 0, t/ 2 nV 2 = {0} and leVa(F|( 7 j, 

lev^(G|yJ are nonempty and bounded. Moreover the set argmin(F + G) of minimizers 
is compact in this case. 


Proof. We use the abbreviations / := F\u^ and g ■= G\v^. 

i) Since in case of levQ(F) n lev/ 3 (G) = 0 there is nothing to show, we assume that there 
is a e leVct(F) n lev^(G). Choose any Z 2 ^ U 2 r\ V 2 with Z 2 =|= 0. Due to Zi + \z 2 e 
leVo(F) n lev^(G) for all A e R, a whole affine line is contained in leVa(F) n lev/ 3 (G). So 
the latter set is unbounded. 


ii) Let U 2 r\ V 2 = {0} and let leVa(/), levying) be bounded. If the set leVa(F) n lev/ 3 (G) 
was unbounded, it would contain an unbounded sequence of points z^^\ k e N. Due to 
levQ,(F) = levQ,(/) 0 U 2 and lev/ 3 (G) = levi 3 {g) 0 V 2 the z^^'> could be written in the form 
z{k) = ^ ^(fc) components e levQ,(/), e lev^( 5 '), forming 

bounded sequences, and second components e U 2 , e V 2 , forming unbounded 
sequences. Lemma A.2 ensures that there is a constant G > 0 such that ||w 2 ~ "^2 II ^ 
G”^||n 2 ^^|| for all k e N. The unboundedness of the sequence (Hn^^^lDfcgN along with the 
boundedness of the sequences (||'u[^^|| 2 )fc£N and (||n|^^|| 2 )fcGN would therefore result in 


0 = IIII 2 = II ^ II — Uo^^l 


u 


(k) „Xk), 


^ G 


-ii 


u 


F)\ 


-(I'Aib + i-rib) 


(fc)i 


+00 


- a contradiction. 

iii) Since the level sets levQ,(/) and lev^( 5 ') of the proper, convex and lower semi continuous 
functions /, g are nonempty and bounded we know that all level sets of / and g are 
bounded, cf. [19, Corollary 8.7.1]. Since domF n domG =|= 0 there are levels d,/? e R 
with levQ,(F) n lev^(G) =|= 0. The bounded sets levQ,(/) and lev^( 5 f) are nonempty, due 
to levQ,(/) 07/2 = levQ,(F) =|= 0 and lev^( 5 ') 0 V 2 = lev^(G) =|= 0. Consequently / and 
g are bounded from below, see Detail 18. Without loss of generality we may therefore 
assume / ^ 0 and ( 7^0 (otherwise we can set m/ := in0jg[/j f{ui), 'kUg ■■= inf^^gy^ hki) 
and replace /, F, a and o', G, /3 by / — mf, F — m/, a — mf and g — mg., G — mg, 
/3 — mg, respectively), i.e. F ^ 0 and G ^ 0. Next we show that lev-_^^(F + G) is a 
nonempty compact set. We have lev-_^^(F + G) ^ levQ,(F) n lev^(G) =|= 0. Furthermore 
lev-^^(F + G) is closed due to being a level set of a lower semicontinuous function. Lastly 
lev(F + G) c lev-_|^^(F) nlev-_^^(G) is bounded by (ii), since the needed boundedness of 
lev-_^^(/) and lev^^^( 5 ') is only a special case of the already mentioned level boundedness of 
/ and g and therewith ensured. Hence lev^^^(F + G) is non empty and compact. Therefore 
the (proper) lower semicontinuous function (F + G)|iev.^^(F+G) = T + G + qev-^^(F+G) must 
be minimized by an -u e lev-^^(F + G), see [20, 1.10 Corollary] or Theorem 2.5.11, which 
clearly also minimizes F + G. Finally we set 7 := F('u) + G{u) e (— 00 , 0 : + [3] and note 
that argmin(F + G) = lev^(F + G) is a closed subset of the compact set lev-_^^ (F + G) 
and hence itself compact. □ 
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Next we are interested in the direction of argmin(F + G). We will see that - under certain 
assumptions - we have (argmin(F + G) — argmin(F + G)) c P[F], which is the core 
ingredient to see that F and G are constant on argmin(F + G). 

Lemma 4.3.19. Let the Euclidean space M” be decomposed into the direct sum M” = Ui@U 2 
of two subspaces Ui, U2 and let F : M” —>■ M u {+00} be a convex function which inheres 
the translation invariance F{x) = F{x + U 2 ) for all x e M” and U 2 ^ t/ 2 . Furthermore, let 
G : M” —>■ M u {+00} be any convex function. Then the following holds true: 

i) //domFndomG =1= 0 and F is strictly convex on Ui then all x,x e aTgmm,,,^^n{F{x) + 
G{x)} fulfill X — X e U 2 and F{x) = F{x), G{x) = G{x). 

ii) //ri(domF) n ri(domG) =|= 0 and F is essentially smooth on Ui and strictly con¬ 
vex on ri(domF n Ui) then argmin 3 ,gRn(F(a;) + G(x)) Q ri(domF) and all x,x e 
argmin^g]g„{F(x) + G(x)} fulfill x — x e U2 and F{x) = F{x), G(x) = G(x). 


Before proving this lemma we illustrate that in general we really need to require F to be 
essentially smooth, in order to guarantee the assertions of part ii) 





Figure 4.4: Up: Graph of h and F, Down: argmin(/i) = [—1,1] x {0} and some other level sets of h 


Example 4.3.20. The shifted Euclidean norm /t;, : > M given by hb{x) ■= ||a; — b\\ 2 , 

where 6 e is strictly convex on every straight line which does not meet b, by Lemma B.2. 
Set b = (1, 0)^ and b' = —b = (—1, 0)^ and consider the function h : R^ ^ R given by 

h{x) = hb{x) + hbfx) = ||a; — b \2 + \\x — b'\\ 2 . 
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The only straight line whieh meets both b and b' is aS{{b,b'}) = M x {0}. Therefore h 
is strietly eonvex on all other straight lines in c.f. also Figure f.f. In particular h is 
strictly convex in the open upper half plane HI := {x e : X 2 > 0}. Set Ui ■= U 2 ■= {0} 
and consider the functions F, G:M^^Mu{+oo} given by 


F{x) 


h{x) + Xi for X e El 

+00 for X e 


G{x) ■■= —xi 


Then all general assumptions of Lemma 4.3.19 are fulfilled just as the assumptions of part 
ii) - except that F is not essentially smooth on Ui = Mf; note here that h is continuously 
differentiable in 6'}, so that choosing any boundary point x e 5domF = dH = M x 

{0}, which is different from b and b', we/ixxe lim„^oo ||VF(x „)||2 = || V/i(x) + (l, 0)^||2 =|= +00 
for any sequence (xn)neN in int(domF) = H, which converges to x. 

We have argmin{F + G} = argmin/i = [—1,1] x {0} here, so that argmin{F + G} n 
ri(domF) = 0. Moreover the minimizers x = (1,0)'^ and x = (—1,0)'^ neither fulfill 
X — X E U 2 nor F(x) = F{x), G(x) = G(x). 


Proof of Lemma 4.3.19. i) First we prove that for any x,y e dom F and the line segment 
/(x, y) ■■= {x + t{y — x) : t e [0,1]} the following statements are eqnivalent; 

a) F\,, , is constant, 

b) is affine, 

c) y — X E U 2 . 


We use the unique decompositions x = Xi + X 2 , ?/ = 2/1 + 1/2 with xi, yi e Ui and X 2 , 2/2 ^ U 2 . 

a) ^ b); This is clear since a constant function is in particular an affine one. 

b) ^ c); If is affine, i.e., 

F{x + t{y — x)) = F(x) + t{F{y) — F{x)) for every t e [0,1], 
the translation invariance of F yields 

F(x + t{y - x) - X 2 - t{y 2 - X 2 )) = F(x - X 2 ) + t{F{y - 1 / 2 ) - F{x - X 2 )), 

F(xi + t{yi - xi)) = F(xi) + t{F{yi) - F(xi)) for every t e [0,1], 

so that is affine as well. On the other hand F is also strictly convex on /(xi, yfi. 

Both can be simultaneously only true, if Xi = yi, which just means that y—x = y 2 —X 2 e U 2 . 

c) ^ a): Let y — x e U 2 , i.e. yi = Xi, so that y — x = y 2 — X 2 . Therefore and due to the 
translation invariance of F we get 

F(x + t{y — x)) = F(x + t{y 2 — X 2 )) = F(x) 
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even for all t e M. In particular F is constant on l{x, y). 

Now the assertions of part i) can be seen as follows: Due to the convexity of F+G the whole 
segment l{x,x) belongs to argmin{F + G] so that F + G is constant on l{x,x). Thus, the 
convex summands F and G must be affine on l{x, x) c dom(F + G). Now the equivalence 
b) c) tells us that x — x = —{x — x) e U 2 and hence F{x) = F{x). The remaining 
G{x) = G{x) follows from the last equation and from F{x) + G{x) = F{x) + G{x) since 
only hnite values occur. 

ii) The function / := F\u-^ : Ui —>■ Mu {+ 00 } is essentially smooth, so that intt/^(dom/) is in 
particular a nonempty subset of Ui. Therefore and by Theorem B.15 we get aff(domF) = 
aff(domF|c/j 0 U 2 ) = aff(domF|t/J ® U 2 = Ui ® U 2 . Lemma 4.3.11 now says that F is 
essentially smooth on aff(domF). The therewith applicable part i) of Lemma B.6 gives 
argmin(F + G) c ri(domF). Hence the minimizers of F + G keep unchanged, if we enlarge 
the values F{x) outside of ri(domF) by setting 


F{x) 


F{x), for a; e ri(domF) 
+00, for a; ^ ri(domF) 


Hence we get the remaining assertions for x,x e argmin(F + G) = argmin(F + G) by 
applying part i) to F and G; note herein that domF n domG = ri(domF) n domG =|= 0, 
that F is still convex, see Theorem B.8, and strictly convex on Ui, since F is by assumption 
strictly convex on ri(domF n Ui) = dom(F|; 7 j), and that hnally U 2 still belongs to the 
periods space F[F], since ri(domF) = ri(dom F|[/j 0f/2) = int;7^(dom/)0f/2, by Theorem 
B.15. □ 


Theorem 4 . 3 . 21 . Let F, G : M'^ ^ M u {+00} be convex functions with ri(domF) n 
ri(domG) =|= 0. If there is a decomposition 

aff(domF) = A® P 

o/aff(domF) into a subspace P of P[F] and an affine subspace A c M"^ such that F is 
essentially smooth on aff(domF) (or on A) as well as strictly convex on intq(domF|^) 
then 


and 


argmin(F(a;) + G(a;)) c ri(domF) 

xsR" 


X — X E P, 


F{x) = F{x), 
G{x) = G{x) 


for all x,x E argmin,j,g]R„(F(a:) + G(a:)). 


Proof. Let a e ri(domF) n ri(domG). Replacing F and G by Fi(-) = F(- — a) and 
Gi(-) = G(- — a), respectively would neither change the truth value of the assumptions 
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nor of the assertions; therefore we may without loss of generality assume a = 0, so that 
aff(domF) is a vector subspace of M”. Write 0 = oq + po with some uq ^ A and po ^ P- 
Due to = -FU+po replacing A by the vector subspace A 2 = A + pq 

would neither change the truth value of the assumptions nor of the assertions; therefore 
we may without loss of generality furthermore also assume that ^4 is a vector subspace of 
aff(domF). Set now Ui ■= A and U 2 ■= P. Noting that neither the truth value of the 
assumptions nor of the assertions changes when considering F, G and F + G only on the 
vector space Ui@U 2 = aff(domF) and identifying it with some M” we obtain all claimed 
assertions by part ii) of Lemma 4.3.19; note here that ri(domF n Ui) = ri(domF) nUi = 
int[/^(domF|f/J, ri(dom G'|afr(domF)) = ri(domG n aff(domF)) = ri(domG) n aff(domF) 
by Theorem B.IO, and note dually that F is in any case essentially smooth on aff(domF) 
by Lemma 4.3.11. □ 

Remark 4.3.22. 

i) The assumptions of the just proven theorem can be only valid if in fact P = P[F]. 

ii) The essentially smoothness as well as the strictly convexity assumptions on F keep 

valid if A is replaced by any other affine subset A c with A® P = aff(domF) = 

A®P. 

Proof, i) Since P is a subspace of P[F] we have P c P[F]. For the proof of P ^ -P[-^] 
we may assume without loss of generality that the affine space aff(domP) is even a vector 
subspace of with origin 0 e domP. Then every arbitrarily chosen p e P[F] belongs to 
aff(domP) and can therefore be written in the form p = d + p with some d e A, p e P. 
Hence d = p — p e P[P], i.e. F{x + Xd) = F{x) for all a; e and all A e M. Choosing 

any element d from the nonempty set ri(domP) n H = ri(domP|^), see Theorem B.8 and 
Theorem B.IO we have in particular P(a + Xd) = F{d) for all A e M. This is only possible 
for a = 0, since P is strictly convex on {a + Aa : A e R} c ri(domP) n H = ri(domP|^) = 
int^(domP|^), where we have used Theorem B.IO. Consequently p = d + p = pEP. 
ii) Writing 0 = do+po, 0 = do+po and noting P|q = F\a+po, P\a = P\a+po without 

loss of generality assume that A and A are vector subspaces. Consider the projection 
TT : A ^ A, X = d + p t—> a of the vector space A = H0P = H0P onto its subspace A. We 
have Af{7r) = P, so that a ■= 7r|q : A ^ A is a vector space isomorphism, which links P|q 
and P|^ both via P|q = P|^ o a and its consequence int^(domP|^) = a[intq(domP|q)]. 
Therefore P|q is essentially smooth if and only if P|^ is essentially smooth. Writing 
O ■= intq(domP|^) and O ■= int^(domP|^) we likewise have that F\q is strictly convex 
if and only if F\q is strictly convex. □ 

4.4 Homogeneous penalizers and constraints 

This section is divided into two subsections. In the hrst subsection we restrict the broad 
setting of the Section 4.2 to a less general setting by making a particular choice for 4/ and 
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by putting some assumptions on $. In Lemma 4.4.1 we show some implications of the 
assumptions on $ for $ itself and its conjugate function $*. In Remark 4.4.2 we will see 
that the Fenchel Duality Theorem 4.2.11 can be applied within our setting. The second 
subsection deals with properties of the minimizing sets. In Theorem 4.4.3 we show that 
the problems (T’i,r), (-P 2 ,a), (Di t-), (D 2 ,a) have a solution for r > 0 and A > 0, if certain 
conditions are fulhlled. In Theorem 4.4.4 we prove that under the same conditions and 
an extra condition there are intervals (0,c) and (0,4) such that SOL(Pi^t-), SOL(P 2 ,a), 
SOL(Di T-), SOL(D 2 ,a) show similar localization behavior for r = 0, A e [d, +oo); r e (0, c), 
A e (0, 4); and r e [c, +oo), A = 0. In Theorem 4.4.6 the localization behavior is rehned for 
r e (0, c) and A e (0, 4). The results there say that, while r runs from 0 to c and A runs 
from 4 to 0, all solver sets have to move. Moreover the mappings, given by r i—> SOL(Pi^t-) 
and A •—> SOL(P 2 ,a) are the same - besides a (“direction reversing”) parametrization change 
g : (0, c) —>■ (0, 4). Similar the mappings, given by r > SOL(iAi,-) and A i—> SOL(D 2 ,a) are 
the same - besides the same parametrization change g : (0, c) —>■ (0,4). In the remaining 
parts of that subsection we deal with g. 


4.4.1 Setting 

In the rest of this thesis, we deal with the functions 


— be,;i|M| and T 2 == 


where || ■ || denotes an arbitrary norm in with dual norm || • := max||a;||^i(-,x). 

Constraints and penalizers of this kind appear in many image processing tasks. Note that 
4/i(r“^x) = iievr\\-\\{^) = Tdevriwi^) T G (0, +oo). The conjugate functions of Ti and T 2 
are 

Tt = ||-||=, and 

and their subdifferentials are known to be 


r {0} if i|x|| < 1, 

44/i(x) = ■( {p e M’” : (p, x) = IIpII;^} if ||x|| = 1, (4.19) 

[ 0 otherwise 


and 


I {psW^: ||p||h= < 1} if ||x|| = 0, 

[ {p e : (p, x) = ||x||, ||p||h= = 1} otherwise. 


(4.20) 


Then the primal problems (P) in (4.10) with p := r ^ > 0 in the case 4/ = Ti and 
p := A > 0 in the case 4/ = 4/2 become 


(Pi^^) argmin {<I>(x) s.t. ||Lx|| < r} , 
(P 2 ,a) argmin {<h(x) + A||Lx||} 

xeR" 
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and the dual problems (D) in (4.11) read 

argmin{$*(-L*p) + t\\p\\:,} , 

psR™ 

(D2,a) argmin{$*(-L*p) s.t. ||p||h= < A} 

psR™ 

We will also consider the cases r = 0 and A = 0. In what follows we will assume that 
Fp := $ : M” —>■ M u {+oo} and Fn ■■= $*(—L*-) : > M u {+oo} are invariant 

under translation in direction of subspaces Up ^2 and 17 d, 2, respectively. Speaking now in 
terms of a general function F : ^ M u {+oo} we could of course always make the 

uninteresting choice U 2 '■= {0}; so more precisely we are interested in those decompositions 
= Ui ® U 2 with F{u + U 2 ) = F{u) for all u e U 2 e U 2 , in which U 2 is chosen as 
large as possible, so that the essential properties of F can be revealed by considering F|p^. 
In case of aff(dom Fl^J = Ui we do not need to rehne the decomposition = Ui ® U 2 
and can think of F to be essentially given by / = F|p^. In case of aff(domF|[/j) c 
Ui, however, it can be convenient to rehne the decomposition R” = Ui ® U 2 by writing 
aff(domF|pj) = a + Xi with some a e aff(domF|pJ and a vector subspace Xi c R^; after 
choosing some vector subspace X 3 with Ui = a + Xi ® X 3 and setting X 2 ■= U 2 we have 
R"" = a + Xi@ X 2 ® X 3 and can think of F to be given essentially by F\a+Xii since the 
inclusion domF a + Xi ® X 2 just means that F{x) = F{a + xi + X 2 + X 3 ) equals +00 
for a ;3 =1= 0 and F(a + Xi + X 2 ) = F{a + Xi) for x^ = 0. 

In those cases where 0 e aff(domFlpJ or where F is replaceable by F(- — a) we can even 
assume a = 0 so that we have R"" = Xi 0 X2 0 X3 and can think of F to be given in 
its essence by F|xi on Xi, then extended to a larger subspace Xi 0 X2 by demanding 
translation invariance in direction X2, and hnally set to +00 on R"^\(Xi 0X2). This is the 
core structure, which $ will now be demanded to have. In addition Xi, X2 and X3 shall 
be pairwise orthogonal; 

Let <h’s domain R'^ have a decomposition R"^ = Xi 0 X2 0 X3 into pairwise orthogonal 
subspaces such that 


<h(a;) = $(a:i + X2 + x^) = 


if 3:3 = 0 


+00 


if 3:3 = 1 = 0 ’ 


(4.21) 


where 0 = $|xi : Xi ^ R u {+ 00 } is a function meeting the following demands: 


i) dom (f) is an open subset of Xi with 0 e dom (f), 

ii) (j) belongs to ro(Xi) and is strictly convex and essentially smooth (compare [19, p. 

251|), 

hi) (j) has a minimizer. 
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The following lemma shows that the sub differentials of 0 and $ are closely related and that 
<h* is of the same basic structure as <h, whereas the roles of X 2 and X 3 are interchanged. 
Note that for the proof of the hrst two parts we use only the direct decomposition of 
<h’s domain M” into the pairwise orthogonal subspaces Xi, X 2 , none of the additional 
properties of 0 is needed. 

Lemma 4.4.1. For a function fulfilling the setting in (4.21) and any points x,x* e M" 
the following holds true: 


i) 


ii) 


d<h(a;) = d^{xi + X 2 + xfi) ■- 

+ xl+ xl) 


0 

d 0 (xi) 0 {0} 0 X 3 

jrixt) ^/x2* = o 

1+00 ifx^^o’ 


if XsfiO 
if X 3 = 0 . 

where 


hi) • 0 * belongs to ro(Xi) and is essentially smooth and essentially strictly convex 
(compare [19, p. 253J) 

• 0 e int(dom 0 *) and 0 e ri(dom$*) 


Proof, i) and ii) We rewrite in the form $ = <l>i w $2 ® 4 > 3 , where 

4*1 = 0 : Xl — > M u {+00}, $2 = 0^2 • -^2 — * 4^3 = t|o} : X3 — > M u {+00}. 

Since R” = W 0 ^2 0 ^3 is a direct decomposition into pairwise orthogonal subspaces we 
can apply Theorem B.16 and obtain 

d$(a:) = d$i(a;i) 0 d^2{x2) 0 d$3(a;3) = dfiixi) 0 {0} 0 6'3(a:3), 

where S^i^Xs) = 0 for 0:3 =|= 0 and S^i^Xs) = X 3 for 3:3 = 0 , as well as 

$00) = + ^*2(4) + + 0 


hi) 0 e ro(Wi) implies 0* e ro(Xi). Changing the coordinate system via an orthogonal 
transformation x ^ x = Qx changes 0 and 0* in the same way; If 0(5;) = 4>{Qx) then 
also 0*(5;) = (j)*{Qx). Hence [19, Theorem 26.3] can be extended for functions like 0,0* : 
Xl—>Ru{+oo}, which are only dehned on a subspace Xi of R”. So the strict convexity 
of 0 implies that 0 * is essentially smooth and the essentially smoothness of 0 implies that 
0* is essentially strictly convex. In order to prove 0 e int(dom0*) we note that argmin0, 
consisting of just one element, is a nonempty and bounded level set of 0. Consequently 
all level sets levQ,(0), a e R, are bounded, compare [19, Corollary 8.7.1]. This implies 0 e 
int(dom0*) (of course regarded relative to Xi), compare [19, Corollary 14.2.2]. Therefrom 
we hnally obtain 0 e int(dom 0 *) 0 X 3 = ri(dom$*), because dom 0 * 0 X 3 = dom$* by 
part ii). □ 
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Remark 4.4.2. By our setting - in first line by the condition i) on $ - we have 0 e 
dom$ and also 0 e ri(dom$*) by Lemma 4-4-1. Therefore our setting ensures that the 
assumptions i) - iv) of Lemma 4.2.11 are fulfilled: Regarding the first three assumptions 
we note TZ{L) = ri(7^(L)) so that every of these assumptions is of the form 

ri(yl) n n{B) 0 

with convex subsets A, B of some Euclidean space. Both for d' = d'l and ^ = '^2 we have 
0 E A and 0 e int(-B) for sets A,B corresponding to condition i), ii) or Hi) of Lemma 
4 . 2 . 11 , respectively. Since A is in any case convex and nonempty there is some au e ri(yl) 
with Ok —* 0, cf. Theorem B.7. Hence we have also ax ^ int(R) for a large enough K. 
In particular ri(yl) n ri{B) = ri(yl) n int(-B) =|= 0 . Also the fourth assumption of Lemma 
4 . 2.11 is clearly fulfilled in our setting, since 0 e IZ{—L*) n ri(dom<!>*). 

4.4.2 Properties of the solver sets and the relation between their 
parameters 

The next theorem shows that all our problems (-Pi,t), (-P 2 ,a), (-D 2 ,a) have a solution 

for r > 0 and A > 0 if certain conditions on argmin $ and Af{L) = argmin ||L-1| are fulhlled. 

Theorem 4.4.3. Let $ e ro(M"') be a function fulfilling the setting (4.21) and let L e M™’"- 
so that X 2 n A/'(L) = {0} and argmin $ n A/'(L) = 0. Then all solver sets SOL(Pi^t-), 
SOL(Di^^), SOL(P 2 ,a )7 SOL(D 2 ,a) are nonempty for t e ( 0 ,+ 00 ), A e ( 0 ,+ 00 ) and the 
corresponding minima are finite. 

Proof. Note in the following that the requirements i) - iv) of Lemma 4.2.11 are fulhlled. 
Let A > 0. Since <h(—L*-) is lower semicontinuous on the compact Ball B ■■= ]Ba(0)[|| • ||*] := 
{p E : IIpII* < A} we have SOL(iA 2 ,A) =1= 0- The attained minimum is hnite, because 
0 E B Ci dom($*(—L*-)) holds true by part hi) of Lemma 4.4.1. Lemma 4.2.11 ensures 
that also SOL(P 2 ,a) =1= 0 , where the attained minimum is hnite, since dom(<l> + A||L • ||) = 
dom$ =1= 0. Let now r > 0. We get SOL(Pi^,-) =|= 0, by part hi) of Lemma 4.3.18, 
applied to F := $, Ui ■■= Xi ©X 3 , U 2 ■= X 2 and G ■= qev.^||L-||, hi — ^(T*), V 2 ■= A/"(L); 
the assumption of this Lemma are checked in Detail 19. Due to the therein appearing 
relation dom<h n levT-||L • || =|= 0 the attained minimum is hnite. Lemma 4.2.11 gives now 
SOL(Di T-) = 1 = 0 , where the attained minimum is also hnite since dom($*(—L*-) +r|| ■ ||*) = 
dom$*(-L*-) 0 . □ 

Recall in the next theorem that inf 0 = +00 since any m e [— 00 , + 00 ] is a lower bound 
of 0 c [— 00 ,+ 00 ]. The theorem states that there are three main areas where our solver 
sets SOL(Pi ,-) and SOL(P 2 ,a) must be located: either they are completely contained in 
argmin ||L • || = Af{L) or argmin $, or they are located “between” them, in the sense of 
SOL(P,) n A/'(L) = 0 and SOL(P,) n argmin 4> = 0. Similar relations hold true for 
SOL(Di^t-) and SOL(D 2 ,a)- Note that SOL(Di^t-) = 0 can happen in the border case 
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r = 0 as we show in Example 4.4.5. Also notice in the following theorem that OP(<h, \\L- 1|) 
can either be (0, +oo) or [0, +oo) for a function $ which fulfills our setting (4.21). In case 
r ^ OP($, IIL • II) we have to be carefull when regarding the problem 

argmin {<l>(a;) s.t. ||Lx|| < r} , 

xgR" 

since rewriting it to 

argmin{$(x) + iiev,||L.||(a:)} 

xsR" 

is not possible in this case, cf. the table on page 77. 

Theorem 4.4.4. Let $ e ro(M'^) be a function fulfilling the setting (4.21) and let L e 
so that X 2 nJ\f{L) = {0}, A 3 n TZ{L*) = {0} and argmin<h r\J\f{L) = 0 . Then the values 

c ■■= inf ||La;|| = min ||Ea;||, (4.22) 

oiGargmin $ aiGargmin ^ 

{ min ||pL, if argmin <h*(—L*-) =|= 0 

p£argmin<I.*(-L*-) ('423) 

+ 00 , if argmin<l> 0 -L*-) = 0 

are positive. Their geometrical meaning for the primal and dual problems is expressed by 
the equations 

c = min{r e [ 0 , +oo) : SOL(Pi^t-) n argmin$ =|= 0 } 

= min{r e [ 0 , +oo) : SOL(Di_,-) n { 0 } 0 } 

and 

d = inf{A e [0, +oo) : SOL(P 2 ,a) n ^^{L) + 0 } 

= inf{A e [0, +oo) : SOL(P 2 ,a) argmin $*(—L*-) =|= 0 }, 

where the infima are actually minima of the latter two sets, if one of them is not empty. 
Furthermore the value of t allows to locate SOL(Pi t-) and SOL(Pi t-), according to 

SOL(Pi,,) c M{L), SOL(Pi,,) c argmin<h0-P*0 = 0 

j SOL(Pi,,) n Ff{L) = 0 If SOL(Pi,,) n argmin $0-L*-) = 0 
|sOL(Pi,^) n argmin $ = 0y |sOL(Pi,.r) n {0} = 0 
SOL(Pi^t-) c argmin $, SOL(Pi^t-) c {0}, if t e [c, +oo). 

The value of X similar allows to locate SOL(P 2 ^a) o-nd SOL(P 2 ,a); according to 
SOL(P 2 ,a) c A'(L), SOL(P 2 ,a) c argmin[d, +oo) 

f SOL(P 2 ,a) n U{L) = 0 If SOL(P 2 ,a) n argmin $0-L*-) = 0 
|sOL(P 2 ,a) n argmin $ = 0 j ’ 1801(^2, a) n { 0 } = 0 

SOL(P 2 ,a) ^ argmin $, SOL(P 2 ,a) ^ {0}, i/A = 0. 



, i/re(0,c) 
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Proof. In the proof we use the abbreviations Mr{a) ■■= ]Br(a)[il • ||] and lB*(a) := Br(a)[|| ■ ||;,=]. 

1 . c is really a minimum; We need only to show that the function /-argmin($) + 11-^ ■ II attains 
somewhere in M” its minimum. In order to apply part hi) of Lemma 4.3.18 we decompose 
M"" into the orthogonal subspaces Ui ■= Xi © W 3 , U 2 ■= X 2 and W := Tl{L*), V 2 ■= M{L)., 
respectively and set F ■= targmin($) and G := || L • ||, respectively; then all assumptions are 
fulhlled for certain a, /3, see Detail 20, so that targmin($) + ||G-1| attains indeed its minimum. 

2. d is really a minimum if argmin $*(—L*-) =|= 0 : Let po ^ argmin $*(—L*-) and set 

r := IIpoII*- The set argmin$*(—L*-) is closed, due being a level set of the lower semicon- 
tinuous function $*(—L*-). Hence C ■= argmin $*(—L*-) is a nonempty compact set, 

which must provide a minimizer p e argmin <!>*(—L*-) for the continuous function || • ||=h|c- 
Clearly we also have ||p||=H = infpsargmin$*(-L*-) ||p||*, since ||p||=H ^ r = ||po||* ^ ||p||* for all 
pe argmin <h*(—L*-)\B*. 

3. Next c > 0 and d > 0 are proven, where we consider only the interesting case 
argmin $*(—L*-) =|= 0. We have 

c = 0 min ||Lx|| = 0 

oiGargmin 

e argmin$ : ||Li;|| = 0 
argmin $ n M{L) =|= 0 . 

Since c ^ 0 this just means c > 0 argmin $ nM{L) = 0 , so that we really obtain c > 0. 
Using some calculus from Convex Analysis we obtain 

d = 0 ^ argmin $ n M{L) =|= 0 , 

see Detail 21. Due to d ^ 0 this just means d > 0 argmin $ n A/'(L) = 0, so that also 
d > 0 . 

4. In order to verify that the different views on c and d are really equivalent, we set 


and 


T ■■= {t E [0, + 00 ) : 3x0 ^ argmin(<I>) : r = ||Lxo||}, 

Tp ■■= {r E [0, + 00 ) : SOL(Pi^t-) n argmin $ =|= 0}, 

Td ■■= {r e [0, +(») : SOL(Di,,) n {0} + 0} 

A := {A e [0, + 00 ) : 3po e argmin: A = ||po||*}, 
Ap := {A e [0, +(») : SOL(P2 ,a) n Af{L) + 0}, 

Ap := {A e [0, + 00 ) : SOL(iA 2 ,A) n argmin <!>*(-L*-) 0}, 


respectively, and show that 

T- U {IILioll}, 

and 

U {boll*}, 

(xo,po)elA 


Tp — Tp, — [||Lxo||, + 00 ) 

Xo^It 

Ap = Ap = [bo||*,+°o), 

{xo,po)elA 
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respectively, where It ■= {xq e M” : 0 e 5<h(a;o)} and I\ ■■= {(xo,po) e M” x : 
Lxq = 0,xo e d<h*(—L*po)} are some index sets. The above way of representing T, Tp, 
To and A, Ap, Ap, respectively, then elucidates c = minT = minTp = minTp and 
d = inf A = inf Ap = inf Ap, respectively; here the headline of part 2 of the proof ensures 
that the last three inhma are actually minima of the respective sets, if one - an thus all - 
of them is nonempty. For all r e [0, +oo) we indeed have by Fermat’s rule 


reT 

3xo 6 M” : 0 e d<F(a;o) a r = ||Lxo 

^3xoE It: TE {||Lxo||} 

^ r e y {\\Lxo\\} 

xo^It 


T eTp 

3xo e M” : ||Txo|| < r a 0 e d<F(a;o) 
^ 3xo e It ■ \\Lxo\\ ^ r 

T E y [||Txo||, +00) 
xogIt 


and - by using again Fermat’s Rule as well as the calculus for sub differentials, see [19, p. 
222-225], X E d<F*(a;*) x* e d<F(a;) and (4.20) - also 


teTd 

^ ^ • 0 E d($*(—L*-) -t- t|| • ||*)|pQ A po = 0 

^Oed(d>’^(-L=^-))lo + rd||-||*|o 

^ 0 e -Ld<h"(-L"0) + rBi[|| • ||,,] 

3^0 e M” : xo e d<F*(0) a 0 e —Lxq + ]Bt-[|| • ||] 
3xo e M” : 0 e d<h(a;o) a ||Ta;o|| < r 
^ r e y [\\Lxo\\, -Foo) 

xqsIt 


Similar we obtain for A e [0, -l-oo) the equivalences 


A e A 

^3PoeR^:Oe d($*(-L*-))U a A = ||po||. 

^ 3po e M™ : 0 e Ld^*{-L*po) a A = ||po||* 

^ IpoeK^’ • ^0 ^ d^*{-L*po) A Lxo = 0 A A = IIpoII* 
^ 3(xo,Po) e /a : A = ||po||* 

{xo,Po)elA 
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besides 


and 


A e Af) 

^ 3po e : 0 e d($*(-L*-))U a A ^ ||po||. 

^ 3po e M'" : 0 e Ld^*(-L*po) a A ^ ||po||* 

^ 5o6®^’ • ^0 ^ d^*(-L*po) A Lxo = 0 A A ^ IIpoII* 
^ 3(xo,Po) e /a : A ^ ||po||* 

^ A e [||Po||*5 3"0o) 

(xo,po)g/a 

A e Ap 

3x0 e M” : 0 e d($(-) + A||L • ||)|xo a Lxq = 0 
3xo 6 M” : 0 e d$(xo) 3 - AL*d|| ■ |||lxo a Lxq = 0 
3xo 6 M” : 0 e d<l>(xo) + AL*d|| ■ |||o a Lxq = 0 
3xo 6 M” : 0 e d<l>(xo) + L^AB* a Lxq = 0 
^ 3poeK’"’ • Po ^ AB;^ a 0 e d<l>(xo) + L*pq a Lxq = 0 
^ aposK"*’ • ll^'oll* ^ A A —L*pq e d<l>(xo) a Lxq = 0 
^ 3poeK’"’ • ^0 ^ S^*{ — L*Pq) a Lxq = 0 a ||po||* ^ A 
^ 3(xo,Po) e /a : A ^ Ipoll* 

^Ae y [bo||*,+oo)- 
{xo,po)elA 


5. Finally we prove the 16 claimed relations of the theorem. The subset-relations for 
r = 0 and A = 0 are trivially true. In oder to prove the primal relations for r e (0, c) 
and r e [c, -l-oo) we make use of c = min{r e [0, -l-oo) : SOL(Pi^t-) n argmin$ =|= 0}. For 
re (0, c) we directly get 

SOL(Pi^t-) n argmind* = 0; 

this also implies that any x e SOL(Pi^t-), r e (0,c) must fulhll ||Tx|| ^ r > 0. (||Tx|| < r 
would mean that x is a local minimizer of <h, i.e. a global minimizer of the convex function 
<F; so we would end up in the contradictory x e SOL(Pi t-) n argmin$ = 0). So we have 

SOL(Pi,b n AA(L) = 0 

for r e (0,c). Furthermore the above reformulation of c ensures that there is an x e 
SOL(Pi t-) n argmin$ for r = c. Clearly also x e SOL(Pi t-) n argmind* for r > c, so that 
SOL(Pi t-) n argmind* =|= 0 for r e [c, -l-oo). Since no solvers of (Pi,t) can be outside of 
argmin $, as soon as one solver of (Pi,t) belongs to this level set of <h, we even get 

SOL(Pi^^) c argmin $ 
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for r e [c, +oo). 

In order to prove the dual relations for r e (0, c) and r e [c, +oo) we use c = niin{r e 
[0, +oo) : SOL(Di^t-) n {0} =|= 0}. For r e (0,c) this immediately implies 

SOL(Di,,) n {0} = 0 

and SOL(Zi)i^c) {0} =|= 0. The latter means $*(—L*0) + c||0j|* < $(—L*p) + c||p||* for 
all p e M™. For r e [c, +oo) addition of the inequality (r — c)||0||* < (r — c)||p||* yields 
<F*(—L*0) + t|| 0||* < $*(—L*p) + r||p||* for all p e M™. This just means 0 e SOL(i4i^T-) for 
re [c, +oo). We even have 

SOL(Di,.) = {0} 

for r e [c, +oo): Let an additional p e SOL(Zi)i^T-) be given. In order to prove p = 0 it 
suffices to check that Theorem 4.3.21 can be applied to F{-) = $*(—L*-) and G(-) = r|| ■ ||*, 
since this theorem would then give t||p||* = G{p) = G(0) = 0 and hence the wanted 
p = 0. Indeed all assumptions of this theorem are fulhlled; Clearly F and G are convex 
functions with ri(domF) n ri(domG') = ri(domF) =|= 0. Next the needed decomposition 
aff(domF) = is obtained, by using Theorem 4.3.16, see Detail 22. Finally Theorem 

4.3.12 ensures that F = E o M is essentially smooth on aff(domF). So all assumptions of 
Theorem 4.3.21 are really fulhlled. Finally we show 

SOL(Di^t-) n argmin$*(—L*-) = 0 

for r e (0, c): Assume that there is a p e SOL(Ili,-) n argmin <F*(—L*-) for are (0, c). The 
functions F(-) = <F(—L*-) and G(-) = r|| • ||* fulhll the assumptions of Theorem 4.3.21, see 
Detail 23, so that p e argmin(F + G) c intyi(domF). Consider F and G now only on the 
vector subspace A ■■= aff(domF) by setting f ■■= F\a ^ ro(A) and || • ||' := || • ||*|a e Fo(A). 
Since F{x) = +oo ior x ^ A we still had p e argminpg^(/(p) + r||p||') and p e argmin^g^ fip)- 
The function / : A M u {+oo}, beeing essentially smooth by Lemma 4.4.1 and Theorem 
4.3.12, would be differentiable in p e int^(domF) = intA(dom/). By Theorem B.5 and by 
Fermat’s rule we had df{p) = {0}. Using Fermat’s rule and the calculus for sub differentials 
we hence obtained 0 e d{f + r|| ■ ||')|p = df{p) + rd|| ■ ||'|p = rd|| ■ ||'|p. The already proven 
SOL(Di,^) n {0} = 0 says p =1= 0, so that equation (4.20) implied the contradictory 
0 ed||-|hpC§,[(||.||%]. 

In order to prove the primal relations for A e (0,4) we make use of 4 = inf {A ^ 0 : 
SOL(P2 ,a) Af{L) =1= 0}, while for proving the primal relations for A e [4, +oo) we may 
assume 4 < +oo, i.e. 4 = min{A ^ 0 : SOL(P2 ,a) Af{L) =|= 0}, since in the vacuous 
case 4 = +oo, meaning [4, +oo) = 0, there is nothing to show. For A e (0, 4) we then get 
immediately 

SOL(P2 ,a) nW(L) = 0 

and for A = 4 we get SOL(P2,d)i^A/'(L) =|= 0. The latter means $(x)+4||La;|| < <h(a;)+4||Lx|| 
for all X e SOL(P2,d) n A/'(L) and x e M”. For A ^ 4 adding (A — 4)||La;|| ^ (A — 4)||Lx|| 
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hence gives $(x) + A||Lx|| < <l>(a;) + A||La;|| for all x e S 0 L(P 2 ,d) n Af(L) and x e M”, so 
that we have S 0 L(F 2 ,a) -^{L) =|= 0 for all A e \d, +oo). We even have 

S 0 L(P 2 ,a) c U{L) 

for A e [d,+oo): Choose any x e S 0 L(P 2 ,a) and consider an arbitrarily chosen 

X e S 0 L(P 2 ,a)- In order to prove Lx = 0 it suffices to check that Theorem 4.3.21 can be 
applied to F = $ and G(-) = A||L • ||, since this theorem would then give A||Lx|| = G{x) = 
G{x) = 0 and hence the needed Lx = 0. Indeed all assumptions of this theorems are 
fulhlled, see Detail 24. Finally we show 

SOL(P 2 ,a) n argmin <F = 0 

for A e (00): It clearly suffices to show that any minimizer of <h can never belong to 
SOL(P 2 ,a) for any real A > 0. To this end £x A e (0, +oo) and let an arbitrary x e argmin $ 
be given. Regard $ and $(•) + A||L • || only on span(x) by considering the functions 
/, h : M -> M u {+oo} given by f{t) ■= <h(fx) and h{t) ■= <F(tx) + A||L(fa;)|| = ^{tx) + m|t|, 
where m ■= A||Lx|| > 0 due to the assumption argmin $ n Af{L) = 0. $ is proper, 
convex, lower semicontinuous and essentially smooth on the affine hull of its effective 
domain of definition. These properties carry over to /, see Detail 25. Since 1 e R is 
clearly a minimizer of / we obtain, using part ii) of Lemma B. 6 , that / is differentiable 
in 1 e R with derivative f'{l) = 0. Hence also h is differentiable in 1 with derivative 
h'{l) = f'{l) + m = m > 0. Consequently there is an e > 0 such that h{l — e) < h{l). Its 
rewritten form $ ((1 — e)x) + A||L(1 — £)x|| < <I>(x) + A||L£|| shows that x is not a minimizer 
of SOL(P 2 ,a). 

In order to prove the dual relations for A e (0, d) we make use of d = inf{A ^ 0 : 
SOL(D 2 ,a) argmin <I>*(—L*-) =|= 0 }, while for proving the dual relations for A e [d, +oo) 
we may assume d < +oo, i.e. d = min{A ^ 0 : SOL(D 2 ,a) argmin =|= 0 }, since 

in the vacuous case d = +oo there is again nothing to show. For A e (0, d) we then get 
immediately 

SOL(D 2 ,a) n argmin = 0 ; 

this also implies that any p e SOL(D 2 ,a) with A e (0, d) must fulfill ||p||=H ^ A > 0. 
(||p||h= < A would mean that p is a local minimizer of $*(—L*-) and hence a global mini¬ 
mizer of this convex function; so we would end up in the contradictory p e SOL(D 2 ,a) 
argmin= 0 ). So we have 

SOL(D 2 ,a) n { 0 } = 0 

for r e (0,d). Furthermore the above reformulation of d ensures that there is an p e 
SOL(D 2 ,a) n argmin for A = d. Clearly also p e SOL(D 2 ,a) n<h*(—L*-) for A > d, 

so that SOL(D 2 ,a) n argmin <I>*(—L*-) =|= 0 for A e [d, -l-oo). Since no solvers of {D 2 ,\) can 
be outside of argmin <F*(—L*-), as soon as one solver of (D 2 ,a) belongs to this level set of 
$*(—L*-), we even get 

SOL(D 2 ,a) c argmin 

for A e [d, -l-oo). □ 
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Now we give the announced example, showing that SOL(Di^^) = 0 can happen in the 
border case r = 0 . 

Example 4.4.5. The particular choice 


<h(a;) := (j){x) ■■= 


x - 1 + log i 

+ 00 


for a; > 0 
for a; < 0 


gives a functions <l> : R —» M u {+oo} that fulfills the requirements of our setting along with 
the identity matrix L ■= (1) and || • || = | • |. The conjugate function $*:R—»Ru{+oo} 
can explicitely be expressed as 


$00 


— log(l—p) for p < 1 
+00 for p ^ 1 ’ 


cf. [5] or [3, p. 50f]. Here clearly the proper function $* is not bounded below so that 
SOL(Di^o) = — argmin$* = 0 . 


The following theorem specihes the relations between (-Pi,t); {P 2 ,\), (Di t) and (-D 2 ,a) for 
the special setting in this section. We will see that for every r e (0,c), there exists a 
uniquely determined A such that the solution sets of {Pi,t) and (P 2 ,a) coincide. Note that 
by the Remarks 4.2.8 and 4.2.9 this is not the case for general functions $, T e ro(R”^). 
Moreover, we want to determine for given r, the value A such that {P 2 ,x) has the same 
solutions as Note that part i) of Theorem 4.2.6 was not constructive. 

Theorem 4.4.6. Let $ e ro(R”) be of the form (4.21) and let L e such that X 2 n 
A/’(L) = {0}, X 3 n Tl{L*) = {0} and argmin$ n ^{L) = 0. Define c by (4.22) and d by 
(4.23). Then, for r e (0,c) and A e (0,d), the problems {Pi,t), iP 2 ,\), (-Dpr), (-^ 2 +) have 
solutions with finite minima. Further there exists a bijective mapping g : (0, c) —» (0, d) 
such that for r e (0, c) and A e (0, d) we have 


SOL(Pi ,0 = SOL(P2 ,a) I 
SOL(Pi0 = SOL(D2,a)J 


if {r,\)Egig 


and for r e ( 0 , c), A e [0, + 00 ) or A e (0, d), re [0, + 00 ), 


SOL(Pi,0 n SOL(P2 ,a) = 0 1 
SOL(Pi,0 n SOL(P2 ,a) = 0 ) 


For (r. A) e gr p any solutions x and p of the primal and dual problems, resp., fulfill 


T = \\Lx\\ and A = ||p0 
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Proof. Note in the following that the requirements i) - iv) of Lemma 4.2.11 are fulhlled 
for r e (0, +oo) and A e (0, +oo). 

Theorem 4.4.3 ensures that all solver sets SOL(Fi^^), SOL(F 2 ,a), SOL(iAi T-), SOL(iA 2 ,A) 
are not empty for r e (0, c) and A e (0, d) and that only hnite minima are taken. 


The core of the proof consists of two main steps: In the hrst step we use Theorem 4.2.11, 
Theorem 4.3.21 and Theorem 4.2.6 ii) to construct mappings g : (0, c) —» (0, d), / : (0, d) 

(0, c) with the following properties: 


Vr e (0, c) : 


SOL(Fi,,) C SOL(P2,5(r)) I 
SOL(Di,,)cSOL(D2,5(.))J’ 


VA e (0, d) : 


SOL(F2 ,a) 

SOL(D 2 ,a) 


SOL(P,,,„|) 1 
SOL(Z),./,„)]■ 


(4.24) 

(4.25) 


In the second step we verify that f o g = id(o,c) and g o f = id(o,d) so that g is bijective and 
(4.24) and (4.25) actually hold true with equality. Finally, we deal in a third part with 
(r. A) ^ gig. 

1. First we show that for all x e M"'\A/”(T), p e M™\{0} and for all A, r > 0 the following 
equivalence holds true: 


^ X E SOL(Pi,,),' 


^ £eSOL(P2,A); 

peSOL(Di,,), 

>■ < 

peSOL(D2,A), 

>^ = \\P\\*^ 


t=\\Lx\\^ 


(4.26) 


We have on the one hand for x e M"'\A/'(T), p e r > 0 and A > 0 the equivalences 


SOL(Pi,.,), pe SOL(Di,,) 

rp e d4/i(r“^Lx), —L*ped^{x) 

Ti(r“^L£) + 4/*(rp) = {t~^Lx,tP}, — L*p e d<h(x) 

||L£|| ^ r, t\\P\\^ = (Lx,p}, - L*p E d<h(x) 

\\Lx\\ = r, t\\P\\^ = (Lx,p}, - L*p e d<h(x) 

^\\Lx\\=t, \\Lx\\\\P\\^ = {Lx^p}, -L*pEd^{x), 

where we used Lemma 4.2.11 in step 1, the Fenchel equality [19, Theorem 23.5] in step 2 
and applied in step 4 the inequality (p,p'} < ||p||||p 1I* for P = Lx, p' = p. On the other 
hand we obtain similar for x e W^\N'{L), p e r > 0 and A > 0 the equivalences 


X eSOL(P2,A), P6SOL(P2 ,a) 

X^^p E d'il)2{XLx), 

'^ 2 {XLx) + 4/2 (A^^p) = {XLx, A”^p), 
^ A||L£|| = {Lx,p), ||p||=, < A, 

^ A||L£|| = {Lx,p), ||p||=, = A, 

^ II.P||4= = A, ||L£||||p||,, = {Lx,p), 


— L*p E d<F(£) 

— L^p E d$(£) 

— L*p E d<F(£) 

— L*p E d<F(£) 

— L^pEd^{x). 
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4. Penalizers and constraints in convex problems 


Adding the conditions A = ||p||h= and r = \\Lx\\, respectively, we see directly that (4.26) 
holds true. Now we can construct the function g on (0, c) as follows: Let r e (0, c) and set 

9i^) ■= WpU 

with any p e SOL(Di^^); this is well dehned by Detail 26. Theorem 4.4.4 assures SOL(Pi T-)n 
A/’(L) = 0, SOL(iAi T-) n {0} = 0 and SOL(iAi .r) argmin $*(—L*-) = 0, so that 

||Lx|| > 0, IItII* > 0, IIpII* < d (4.27) 

for all X E SOL(Pi T-), and for all p e SOL(Pi^t-); see Detail 27 for the last inequality. By the 
second and third inequality in (4.27) we see that g^r) e (0,d), so that g : (0,c) (0,d). 

The wanted inclusions in (4.24) follow now from (4.26), which is allowed to apply, by the 
hrst and second inequality in (4.27). 

The function / on (0, d) is constructed as follows: Let A e (0, d) and set 

/(A) := ||Lx|| 

with any x e SOL(P 2 a); this is well dehned, by Detail 28. Theorem 4.4.4 assures SOL(P 2 a)i^ 
{0 } = 0, SOL(P 2 ,a) ■A/'(L) = 0 and SOL(P 2 ,a) argmin<h = 0, so that 

IIpII* > 0, \\Lx\\ > 0, \\Lx\\ < c (4.28) 

for all p e SOL(P 2 ,a), and for all X e SOL(P 2 ^a); see Detail 29 for the last inequality. 

By the second and third inequality in (4.28) we see that /(A) e (0, c), so that / : (0, d) 
(0,c). The inclusions in (4.25) follow now from (4.26), which is allowed to apply, by the 
hrst and second inequality in (4.28). 

2. First we note that 


SOL(Pi,,) n SOL(Pi,,0 = 0, (4.29) 

SOL(P2 ,a) n SOL(P2 ,a') = 0 (4.30) 

for all distinct r, t' e (0, c) and all distinct A, e (0, d), respectively, cf. detail 30. 

Next we prove the bijectivity of g : (0, c) ^ (0, d) by showing fog = id(o,c) and gof = id(o,d). 

In doing so we will also see that (4.24) actually holds true with equality. Let r e (0, c) be 
arbitrarily chosen and set t' = f{g{T)). Using (4.24) and (4.25) with A = g{T) yields 

SOL(Pi,.) c S0L(P2,,(.)) c SOL(Pi,.0, 

SOL(Pi,,) c SOL(P2,5(r)) c SOL(Pi,,0. 

Since SOL(Pi t-) =|= 0 we must have r = r' in order to avoid a contradiction to (4.29). 
Similarly we can prove for an arbitrarily chosen A e (0, d) and A' := g{f{X)) that A = A', 
see detail 31. 

3. It remains to show SOL(Pi^^) n SOL(P 2 ,a) = 0 and SOL(Pi^^) n SOL(P 2 ,a) = 0 
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4.4 Homogeneous penalizers and constraints 


for these (r, A) e [(0, c) x [0,+oo)] u [[0,+oo) x (0, d)] with (r, A) ^ gr^'. Having The¬ 
orem 4.4.4 in mind, we may restrict us to those (r, A) e (0, c) x (0, d) which are not in 
gr^'. For such r, A we have r =|= and A =|= g(T). By (4.29) and (4.30) we therefore 

have SOL(Fi^t-) n SOL(Pi g-i(;,)) = 0 and SOL(D 2 ,a) SOL(iA 2 ,g(T-)) = 0- Substituting 
SOL(Fi g-i(A)) by SOL(F 2 ,a) and SOL(D 2 ,g(r)) by SOL(iAi^T-) we are done. □ 


Here are some more properties of the function g. 

Corollary 4.4.7. Let the assumptions of Theorem f.f.G be fulfilled. Then the bijection 
g : (0, c) —> (0, d) is strictly monotonic decreasing and continuous. 


Proof. Since decreasing bijections between open intervals are strict decreasing and contin¬ 
uous we need only to show that / = g"^ : (0, d) —>■ (0, c) is decreasing. Let 0 < Ai < A 2 < d 
and Xi e argmin 3 ,g]gn{$(x) 4- Ai4/(x)}, i = 1,2, where 4/(a;) := ||La;||. 

Then we know that Tj = 4/(0), i = 1,2. Assume that 4/(£i) < 4/(0). Then we obtain 
with A 2 = Ai -I- £ and e > 0 the contradiction 

$(0) -4 A24/(0) = <F(0) 4- Ai4/(0) -4 £4/(0) 

^ 4>(a;i) -4 Ai4/(xi) -4 £4/(0) 

> 4>(xi) -4 Ai4/(xi) -4 £4 /(xi) 

= 4>(xi) 4- A24 /(xi). 

□ 


Remark 4.4.8. The function g is in general neither differentiable nor convex as the fol¬ 
lowing example shows: The strictly convex function <F, given by 


4>(a;) := 


{x — 

2{x-3Y + 2 


for X < 2 
for X > 2 


has exactly one minimizer, namely xq = 3. Clearly <h, || • || := | • | and L = 
assumptions of Theorem f.f.d if we set X 2 ■= {0}. For A ^ 0 and r e (0,c) 
have 


( 1 ) fulfill all 
= ( 0 , Xo) we 


argmin{<l>(x) s.t. |x| < r} = {r} =: {x}. 

xsR 


By Theorem f.f.d we have argmin(<l>(-) 4- A| ■ |) = {r} exactly for A = gir). An explicit 
formula for gij) is obtained by applying Fermat’s rule: 0 e d(<h(-)-4 5 '(r)| ■ |)|t-= ({$'(■)}-4 
• Dir = {‘^’^('?‘) + gi'r)}; by rearranging we get 



^2(4-r) 

for 0 < r < 2 

g{T) = -$'(r) = ^ 

4 

for T = 2 


.4(3-0 

for 2 < T < xq 


Obviously g is neither differentiable nor convex. 
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APPENDIX 


Supplementary Linear Algebra and 

Analysis 


Lemma A.l. Let V and W be vector spaces over R. A mapping ip \ V is linear if 

i) ip{v + v') = ip{v) + ipiy') for all v, v' e V, 

ii) (p(tv) = t(p(v) for all V e V and all t e [0,1]. 

Note that only t e [0,1] is required. 

Proof of Lemma A.l. By assumption ip is additive. Moreover ip is also homogeneous; Let 
n e 1/ be arbitrarily chosen. In case t e [0,1] we have ip{tv) = tp>{v) by assumption ii). In 
case t e (1, +oo) application of the same assumption to t' ■■= j s [0,1] and v' ■■= tv s V 
yields ip{tv) = tt'(p{v') = tip{t'v') = t(p{v). Using ip{iv) = i(p{v) for h e U, t e (0, +oo) and 
ip{—v) + p){v) = ip{—v + n) = 99 ( 0 ) = ■ 0 ) = 0(/7(0) = 0 , i.e. ip{—v) = —p^iv) we hnally 

obtain also in case t e (—oo,0) the equation ipitv) = p){—t{—v)) = —tip{—v) = tp{v). □ 

The following Lemma provides a useful inequality, which reflects the fact that a direct 
decomposition X = Ai 0 A 2 of an Euclidean vector space X of hnite dimension can only 
consist of subspaces Xi and A 2 which form a strict positive angle a e (0, ^vr], analytically 
described by 


— 1 < cos( 7 r — a) 


inf 

ftieXi\{0},/i2eX2\{0} 


<^1,^2) 

hl||2||h2||2 


The equivalent inequality inf/jjgXinSi,/i 2 sX 2 nSi(^i) ^ 2 ) > —1 follows indeed easily from the 
inequality of the next theorem for || • || = || • II 2 , see Detail 32. Note however that the above 
inequality and the inequality in Lemma A .2 are in general only true in hnite-dimensional 
spaces. These inequalities do not directly transfer to inhnite dimensional inner product 
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A. Supplementary Linear Algebra and Analysis 


spaces as the example X = span{ei}0span{ei + ^ 62 , ei + ^ 63 , ei + ^ 64 ,... } c l 2 (R) shows; 
recall here that the notation X = Xi ©X 2 still shall mean only an inner decomposition in 
the sense of pure vector spaces without demanding additional properties like (topological) 
closeness on Xi and X 2 . 

Lemma A.2. Let Ai, A 2 be subspaces o/M"' with Xi ri X 2 = {0} and let || • || be any norm 
on M”. Then there is a constant (7^1 such that 

II hi II ^ (7|| /ii + /i21 


for all hi e Ai and /i 2 e A 2 . 

Proof. It suffices to hnd a constant C > 0 for which the claimed inequality holds true, 
since enlarging the constant then clearly keeps the inequality true. In case hi = 0 the 
inequality is fulhlled for any C > 0. Therefore we may assume without loss of generality 
that hi e Ai n Si; note therefore that the following statements are equivalent; 

3(7 > 0 Vhi e Ai\{0} Vh 2 e A 2 : ||hi|| < C\\hi + hsl, 

3(7 > 0 Vxi e Ai n Si Vx 2 e X 2 : ||a:i|| < C\\xi + X 2 \\. 

So we need only to hnd a constant (7 > 0 such that ^ < ||/ii + h 2 || for all hi e Ai n S and 
all h 2 e A 2 . We have 

||hi + h 2 || ^ |||^ 2 || — ll^illl ~ ll^ 2 || — 1^2 

for ||h 2 || ^ 3, on the one hand. The mapping p : (Ai n Si) x (A 2 n B 3 ) —» R, given 
by (p(hi,h 2 ) := ||hi + h 2 ||, is continuous on its compact domain of dehnition. Therefore 
if attains its minimum c = (p{hi,h 2 ) for some hi e S n Xi, h 2 e A 2 n B 3 . Combining 
Xi n X 2 = { 0 } and ||hi|| =|= 0 ensures h 2 =|= —hi, so that c = ||hi + h 2 || > 0 and hence 
||hi + h 2 || ^ c > 0 for all hi e Xi n S and h 2 e X 2 n B 3 on the other hand. In total we have 
||hi + h 2 || ^ min{ 2 , c} > 0 for hi e Xi n S and h 2 e X 2 . Setting (7 := > 0 we are 

done. □ 

Next we introduce the notion of an affine mapping via four equivalent conditions; note 
therein that condition i) can also be demanded for a function / which is dehned only on a 
nonempty convex set. For condition ii) and iii) c.f. also [19, p. 7]. 

Definition A.3. Let A,A' be nonempty affine subspaces o/R” and U, U' c R”- the corre¬ 
sponding vector subspaces that are parallel to A and A', respectively. A mapping f \ A ^ A' 
is called afRne, iff one of the following eguivalent conditions is fulfilled: 

i) /(oi 3" ^(a -2 — oi)) = /(o-i) + t{f{a 2 ) — /(oi)) for all ai, 02 e A and all t e [0,1], 

ii) /(ai + t{a 2 — ai)) = f{ai) + f(/(a 2 ) — /(oi)) for all ai, 02 e A and all f e R, 
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iii) There is a linear mapping (p : U ^ U' such that 
/(fl 2 ) — /(oi) = V?(a 2 — oi) for all ai,a 2 e A, 

iv) There is a linear mapping (p : U ^ U' and a point aoe A such that 
f{a) = /(ao) + 92(0 — oq) for all a e A. 

Remark A.4. The four conditions are really equivalent: 

“iv) ^ iii)”; Let p : U U' be linear and qq e A such that /(a) = /(ao) + p{a — Oq) for 
all a E A. Then we get 

f{a 2 ) - /(ai) = /(as) - /(oq) - [/(ai) - /(oq)] 

= p>{a2 - ao) - ipiai - Uq) 

= pi{a2 — ao — [ai — ao]) 

= p>{a2 - ai) 

for all ai, as e A. 

“iii) ^ ii)”; Using iii) for a[ = ai e A and 02 = 01 + t(as — Oi) e A we get 

/(oi + t{a2 - oi)) - /(oi) = (p{a2 - a'l) = p>{t{a2 - Oi)) 

= tp>{a2 - oi) = ^(/(as) - /(oi)) 

for all oi, Os e A and all t e M, so that ii) holds true. 

“ii) ^ i) ” is obviously true. 

“i) ^ iv)”; Choose any oq e A and set p{u) ■■= /(oo + u) — /(oq) for u e U. Then clearly 

ip : U ^ U' and f{a) = /(oq) + p{a — Oq) for all a e A = oq ® U. It remains to show that 

p) is linear. By Lemma A.l it suffices to show that p> is additive and fulfills p>{tu) = tp>{u) 
for all u E U and all t e [0,1]. In order to prove the latter let u e U be arbitrarily chosen. 
Using i) with oi = oo e A and 02 = ao + u e ao + U = A we obtain indeed 

Titu) = /(oo + tu) - /(oo) 

= /(ao + t{a2 - ao)) - /(oq) 

= /(ao) + ^[/(a 2 ) - /(oo)] - /(ao) 

= t[f{ao + u)- /(ao)] 

= tp{u) 

for all t E [0,1]. In order to prove the additivity of we note that choosing t = ^ in i) gives 
the equation /(|(ai + os)) = |[/(ai) + /(os)] for all ai,as e A. For arbitrarily chosen 
u,u' eU we obtain therefrom and by I e [0,1] the identity 

(p{u + u') = /(oo + u + u')- /(oo) = / (|([ao + 2 a] + [oq + 2 a'])) - /(oo) 

= |/(ao + 2 a) + \f{ao + 2u) - /(oq) 

= ^p>{2u) + ^p>{2u') = p{l2u) + (p{^2u') = p>{u) + p{u'). 

So p> is additive as well. 
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APPENDIX 


Supplementary Convex Analysis 


Lemma B.l. Let F : M"" ^ M u {+00} be a convex function. 

i) For any two points x,y e dom F and X e M. we have 

F((l - \)x + \y) < (1 - \)F{x) + XF{y) if X e [0,1], (B.l) 

F{{l-X)x + Xy)^{l-X)F{x) + XF{y) z/AeR\(0,l). (B.2) 

ii) If there are three different collinear points a,b,c e dom F which yield the same value 

F{a) = F{b) = F{c) then F is constant on the line segment co({a, b, c}) spanned by 

these three points. 

Proof, i) The inequality (B.l) is just the inequality from the dehnition of convexity. In 
order to prove (B.2) we set 

z\ ■■= X + X{y — x) = {1 — X)x + Xy (B.3) 

for A e M\(0,1). If F{z\) = +00 we clearly have F{z\) = +00 ^ (1 — X)F{x) + XF{y). 
Assume now F{z\) < +00, i.e. ^ domF. In case A ^ 1 rewriting equation (B.3) yields 
the convex combination y = —X^x + ^z\ = (1 — j)x + jZx and hence by the convexity of 
F the inequality F{y) ^ (1 — X)F{x) + jF{z\). Since only hnite values occur this can be 
rewritten as jF{z\) — l)F(a:) + F{y) which is equivalent to the claimed inequality 
in (B.2), since A ^ 1 > 0. In case A < 0 we can similar write x as convex combination 
X = —j^y + = (1 — j^)y + so that the convexity of F yields the inequality 

F{x) < (1 — Yf^)F{y) + y^F( 2 ;a)- Since only hnite values occur this can be rewritten as 
j-^F( 2 ;A) ^ F{x) + j^F{y) which is equivalent to the claimed inequality in (B.2), since 
1 - A ^ 1 > 0. 

ii) Without loss of generality we may assume that b is the point “between” the endpoints 
a and c, so that co{a, b, c} —■ l{a, b) is the line segment between a and c. Set v ■= F(a) = 
F{b) = F{c) E M. We have to show that any ^ e l{a,c) also fulhlls F{z) = v. In case 
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z E l[a, b), we can write 2 ; as convex combination z = {1 — X)a + Xb with some A e [0,1] and 
as affine combination 2 : = (1 —A') 6 +A'c with some A' e M\(0,1), respectively. So inequalities 
(B.l) and (B.2) give F{z) < {1-X)F{a) + XF{b) = v and F{z) ^ (1-A')F(6)+ A'F(c) = n, 
respectively. All in all we thus have F{z) = v. In case 2 ; e l{b,c) = l{c,b) we get the 
assertion analogously by interchanging the roles of a and c. □ 


Of course norms are not strictly convex. However we have the following lemma. 

Lemma B.2. The Euclidean norm || • II 2 : M" ^ M is strictly convex on every straight line, 
which does not contain the origin 0. 


Proof. Let I be a straight line in with 0 f I and let x,y e I he two distinct points. 
The strict Cauchy-Schwarz Inequality {x,y} < ||x|| 2 ||y ||2 holds true for x and y, since these 
vectors are linearly independent. For all A e (0,1) we hence get 

||Aa:+ (1 - X)y\\l = ||Ax ||2 + ||(1 - X)y\\l + 2A(1 - X)(x,y} 

< ||Ax ||2 + 11(1 — X)y\\l + 2A(1 — A)||a:|| 2 ||y ||2 
= (||Aa;||2 + ||(l-A)|/||2f 

and therewith the needed ||Aa; + (1 — X)y \\2 < A||x ||2 + (1 — A)||?/|| 2 . □ 

The following Theorem is obtained from [19, p. 52] and [19, Theorem 7.4]. 

Theorem B.3. Let f : M u {+00} be a proper, convex function. Its closure elf 

fulfills 

i) cl/(a:o) = lim infa.^,j,o f{x) for every XqeR^. 

ii) cl/ is a proper convex and lower semicontinuous function which agrees with f except 
perhaps at relative boundary points o/dom/. 

For the proof of the following theorem see [19, Corollary 7.5.1] 

Theorem B.4. For a function F e Fo(M'^) one has 

F{x*) = limF((l - A)a + Ax*) 

Atl 

for every a e domF and every x* e 

For the proof of the following theorem cf. [19, Theorem 26.1] after identifying aff(domF) 
with some R™'. 
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Theorem B.5. Let F e ro(M") he essentially smooth on aLL^dovei F) —■ A. T/ien 5 (F|a)(x) 
eontains at most one subgradient for every x e M”. In ease x f ri(domF) we have 
d(F|^)(x) = 0 while in ease x e ri(domF) there is exactly one subgradient in d{F\A){x). 
In particular the function F\a is subdifferentiable in every x e ri(domT). 

Lemma B. 6 . Let F :ML ^ Mu{+oo} be a proper and convex function, which is essentially 
smooth on A ■= aff(domF). Then 

i) argmm^g]R„(F(a;) + G(a;)) c ri(domF) for every convex function G : M” Mu {+ 00 } 
with ri(domT) n ri(domG) =|= 0. 

ii) argmiii^gj^n F{x) c ri(domF) and F\a is differentiable in every x e argminF. 


Proof, i) Let all assumptions be fulfilled. By Theorem B.3 we may further assume without 
loss of generality that F is closed, i.e. lower semicontinuous, since replacing F by clF would 
neither affect the assumptions nor the assertions of the theorem. Let x e argmin(F + G). 
Restricting F and G to ^4 = aff(domF) by setting / := F\a and g ■= G\a we still have 
X E argmin(/ + g). Using Theorem B.IO we see that still 

ri(dom/) n ri(dom 5 f) = ri(domF) n ri(domG n yl) = ri(domF) n ri(domG) n A 
= (ri(domF) n yl) n ri(domG) = ri(domF) n ri(domG) =|= 0 . 

Using the therewith applicable Sum rule and Fermat’s rule we obtain 

Oe d{f + g){x) = df{x) + dg{x). 

In particular df{x) =|= 0 so that the essentially smoothness of / gives x e inty 4 (dom/) = 
ri(domF) by Theorem B.5. 

ii) The inclusion follows from the just proven by choosing G = 0 since then ri(domF) n 
ri(domG) = ri(domF) =|= 0 by Theorem B. 8 . From the inclusion we now also get the 
differentiability assertion by applying Theorem B.5. □ 


The proofs of the following two theorems can be found in [19, p. 45]. 

Theorem B.7. Let G be a convex set in M'^. Let x e ri(G) and x e G. Then (1 — X)x + Xx 
belongs to ri(G) (and hence in particular to C) for 0 < A < 1. 

Theorem B. 8 . Let C he any convex set in M”. Then G and ri(G) are convex sets in M"", 
having the same affine hull, and hence the same dimension, as C. In particular ri(G) =|= 0 
tfC^0. 


The following theorem is obtained from [19, Theorem 7.6] and [19, Theorem 6.2]. 
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Theorem B.9. For a proper, convex function T : M” —^ M u {+ 00 } and r e (inf F, + 00 ) 
we have 


ri(levT-F) = ri(lev<T--F) = lev<T-F n ri(domF). 

Furthermore all these sets have the same dimension as domF. 

Theorem B.IO. Let C be a convex set in M"", and let A be an affine set in M” which 
contains a point of Ti{C). Then 


Ti{A r\ C) = Ar\ ri(C'), (B.4) 

~Ar^ = Ar^C, (B.5) 

rb(AnC') = Anrb(C'), (B.6) 

aff(yl n F) = n aff(C). (B.7) 


Proof. For the proof of the hrst and the second equality see [19, Corollary 6.5.1], With 
these statements we now also get 

rh{A Ci C) = A Ci C\ri (74 n C) = (A n C)\{A n ri(C')) = A n (C\ri(C)) = A n rb(C'). 

For the proof of the remaining forth statement let a e ^4 n ri(C'). Since the truth value 
of the assertion stays unchanged when translating the coordinate system we may assume 
a = 0, so that aff(y4) = span(yl), aff(C') = span(C) and aff( 74 n C) = span( 74 n C). Due to 
span(ylnC) = span(yln (span(C') nC)) = span((yln span(C')) nC) and yin span((7) = (yin 
span(C')) n span(C') we may restrict us to subspaces yl c span(C'), so that we can identify 
span(C) with where m = dim(span(C)). Choose £ > 0 so small that c C. Then 
span(yl) = span(yl n B^) c span(yl n ( 7 ) c span(yl) n span((7) = span(yl) n = span(yl) 
so that we have in particular span(yl n (7) = span(yl) n span((7) = A n span((7). □ 

Theorem B.ll. For convex subsets Ci and C 2 ofW^ the following are equivalent: 

i) Cl + C 2 = Cl 0 C 2 , 

ii) aff((7i) + aff(( 72 ) = aff((7i) © aff(( 72 ). 

Proof. Assume without loss of generality that Ci and C 2 are not empty. Translating Ci or 
C 2 does neither change the truth value of the statement Ci + C 2 = Ci ® C 2 nor the truth 
value of the statement aff((7i) + aff(( 72 ) = aff((7i) ©aff(( 72 ). Without loss of generality we 
may therefore assume 0 e ri((7i) and 0 e ri(<72). 

Clearly ii) implies i), since Ci c aff((7i) and C 2 ^ aff(( 72 ). We show the remaining 
direction i) ^ ii) by proving its contrapositive; assume that the sum aff((7i) + aff(( 72 ) 
is not direct, so that there are distinct ai,a'^ e aff((7i) and distinct 02,02 e aff(( 72 ) such 
that Oi + a 2 = a'l + a' 2 . Let a be any of the four points and let C be the corresponding 
set Cl or C 2 . We can hnd a Aa > 0 such that \aO e (7; indeed, by 0 e ri((7) there is 
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an e > 0 such that n aff(C') c C. Hence and since aff(C') is an affine set we get 
Xatt = Xatt + (1 — A)0 e aff(C') n B^ c C* for Xa chosen sufficiently small. For sufficiently 
small chosen A > 0 the four points Ci := Aai, c'^ := Aa'^ and C 2 == Xa 2 , '■= Xa!^ belong 
hence to Ci and C 2 , respectively, and fulfill still Ci + C 2 = c!^ + C 2 under preservation of the 
distinctions ci =|= and C 2 =|= c^. In particular the sum Ci + C 2 is also not direct. □ 

Remark B.12. The condition that Ci and C 2 are convex is essential to guarantee the 
implication Ci + C 2 = Ci ® C 2 ^ aff(C'i) + aff(C 2 ) = aff(Ci) 0 aff(C' 2 ) as the following 
example shows: Consider the sum of the upper circle line Ci ■= {(cos(t), sin(f)) : t e [0,7r]} 
with the line C 2 ■= {(0, A) e : A e M}. We have Ci + C 2 = [—1,1] x M = Ci © ( 72 . 
However aff((7i) = and aff(( 72 ) = C 2 , so that the sum aff((7i) + aff(( 72 ) is clearly not 
direct. 

Lemma B.13. Assume that two nonempty convex sets Ci,C 2 ^ MA give a direct sum 
(7i © ( 72 . Restricting the vector addition + : M"" x M"" ^ M"' to Ci x C 2 gives then a 
homeomorphism between the product space Ci x C 2 and the (topological) subspace (7i © (72 
o/M”. 


Proof. By theorem B.ll we know that the sum of aff(dom (7i) =: Ai and aff (dom C 2 ) —■ A 2 
is also a direct one. Therefore it suffices to show that + \aixA 2 is a homeomorphism between 
Ai X A 2 and Hi ©H 2 . Choose any a* = (a*, a|) e Hi x H 2 and set Xi := Hi — a* and 7^2 := 
H 2 — a 2 . Noting that +|aixA 2 is a homeomorphism between Hi x H 2 and Hi©H 2 if and only 
if / := +|xixa :2 is a homeomorphism between Xi x X 2 and Xi ©X 2 it suffices to prove the 
latter. To this end note that / is clearly continuous and surjective. Since X 1 +X 2 = Xi@X 2 
we see that / is also injective and hence bijective. Finally f~^ : Xi © X 2 ^ 7^i x 7^2 is 
continuous; Let a; = xi + 0:2 e Xi © X 2 and let = x^i ^ + x^^ e Xi © X 2 converge 
to X. We have to show that f~^{x^^^) = {x^^\x^^) converges to f'^^{x) = {xi,X 2 ). By 
Lemma A.2 we know that there exists a constant (7 > 0 such that ||/ii|| < C\\hi + /i 2 || for 
all hi e Xi, ^2 e X 2 . In particular we obtain 

llxi^^ — 2;i|| ^ C\\{x[^'^ — Xi) + (^2^^ — 0 : 2 )II = (7||a;*^^i — a;|| 0 


as k — >■ +00, so that x[^'^ — >■ Xi as h — >■ +00. By role reversal we obtain also x^'^ — >■ X2 as 
k —>■ +00, so that really {x^i \x^'^) —>■ (a;i,a;2) as k ^ +00. □ 


The key in the previous proof was that the directness of the sum of two convex sets (7i, C 2 
keep maintained when enlarging these sets to their affine hull. This is, however, in general 
not true for a direct sum Ci® C 2 , where one of the summands (7i, C 2 is not convex. In 
such cases it can happen that +|c'ixC '2 : (7i x (72 ^ (7i © (72 is no longer a homeomorphism, 
as the following example illustrates: 

Example B.14. Consider the non-convex set Ci ■= {0,1} and the convex set C 2 '■= [0,1). 
Although their sum Ci + C 2 = [0, 2) = ( 7 i ©(72 is a direct one, the sum aff((7i) + aff(( 72 ) = 
M + M is not direct and +|cixC 2 *■5 '>T'Ot a homeomorphism between Ci x C 2 and Ci® C 2 , 
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since these topological spaces are not at all homeomorphic: Ci 0 6*2 = [ 0 , 2 ) is a connected 
space while Ci x 6*2 = [{ 0 } x [ 0 , 1 )] u [{ 1 } x [ 0 , 1 )] is not a connected space. 

Theorem B.15. Let C and A be a convex and an affine subset ofW^, respectively, whose 


sum C + A is direct. Then the following holds true: 

11(^0^) = ^©11(0), (B. 8 ) 

0 C* = 0 C, (^-9) 

rb(A 0 C) = A 0 rb(C'), (B.IO) 

aff(yl ®C) = A® aff(C'). (B.ll) 


Proof. Assume without loss of generality that A and C are not empty. Note hrst that the 
“largest” sum of the four right hand side sums, i.e. the sum A + aff(C') is a direct one by 
Theorem B.ll. Hence the other three sums A + ri(C'), A + C and A + rb(C') are direct 
all the more. Noting that the truth value of the statement aff(A + (7) = A + aff(C') does 
not change when translating A or (7 we may assume 0 e A and 0 e C without loss of 
generality, so that in particular A A + C and C ^ A + C. We then get 

A + aff(C') = aff(A) + aff(C') c aff(A + C) + aff(A + C) = span(A + C) + span(A + C) 
= span(A + ( 7 ) c span(span(A) + span((7)) = span(A) + span((7) = A + aff(C') 

and therewith A 0 aff((7) = span(A 0 (7) = aff(A 0 (7). 

Consider now the topological spaces (7i := aff(A) = A and C 2 ■= aff((7) and their product 
space Cl X C 2 , equipped with the product topology. We have 

intcixC 2 (^ X (7) = intci(A) x intc 2 (C') = A x intc 2 (C'), 

Ax C = A X C = A X C 

and 

dcixC 2 (^ xC) = A7^C^^""'^intclxC2(^ X C') 

= (^A X (7^"^ \ (A X intc 2 (C')) 

= A X (c^^intc,(C)) 

= Axdc,iC). 

By means of the homeomorphism +|cixC 2 : Ci x 6*2 —>■ (7i 0 (72 from lemma B.13 these 
three equations can be translated to 

intci+c 2 (^ + (7) = A + intc 2 (C'), 

= A + 

and 

dc,+c2{A + C) = A + dcAC), 

which gives the equations (B. 8 ), (B.9), (B.IO). □ 
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The following theorem is a special case of [33, Corollary 2.4.5] and an equation used in 
its proof. Cf. also [20, Theorem 10.5]. Note that we need an orthogonal decomposition 
M” = Xi 0 X 2 ■ ■ ■ © Xn in order to guarantee (x, x*') = 


Theorem B.16. Let = Xi©- • ■©X 2 be a decomposition ofW^ into pairwise orthogonal 
vector subspaces Xi,... ,Xn. For any proper functions /j : Xj M u {+00} and their 

n 

semidirect sum / = /i w /2 w ... w /n : M” ^ Mu{+oo}, f{x) = f{xi + - ■ ■ + Xn) ■= 2 


we have 


i) [/l w /2 w . . . w fnY = /f w /2'‘ w . . . w fn, hc. 

n 

f*{x*) = f*{xl + ■ ■ ■ + X*) = 2 fi{x*) for every x* e 

n 

ii) df{x) = df{xi + ■ ■ ■ + Xn) = 0 dfi{xi) for every x e M”. 

i=l 

Proof, i) For any x* = (x*,..., x*) we have 

n n 

f{x*) := sup[<x,x*> - /(x)] = sup ... sup V[<Xi, X*> - /i(Xi)] = .ff (x-). 

XSM" XlSXl XnSXn 

ii) Let X = Xi + ■ ■ ■ + x„ be arbitrarily chosen. In case x* f dom /* for some i the equation 
and the directness of its right-hand side sum holds vacuously true. In case Xj e dom /* for 
all i E {1 ,..., n} the claimed equation also holds true since for any x* = x* -I- ... x* we 
have the equivalences 

X* e df{x) 

= Zi + Z 2 + ■ ■ ■ + '■ f{z) ^ /(x) + {z — X, X*} 

■^'^Z = Zi + Z 2 + ■ ■ ■ + Zn^^^ '■ f{z) — /(x) — {z — X, X*) ^ 0 

n 

^yz = Zi + Z2 + ■■■ + Zn^W^ ■■ ~ ~ ~ ^ ^ 

i=l 

^ Vf e { 1 ,..., n} e Xj : fi{zi) - fi{xi) - (zi - x*, x*} ^ 0 
^ Vf e {1,... ,n} : X* e dfi{xi). 

Finally note that the directness of the sum d/i(xi) © ■ ■ ■ © dfn{xn) is inherited from the 
direct sum Xi © • • • © X„. □ 


As corollary of the previous Theorem B.16 we get the following theorem. 
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Theorem B.17. Let / : M"" ^ M u {+ 00 } be a proper funetion and let dom/ be eontained 
in some affine subset A of ML with difference space U. Then 


for every x e 


df{x) 


0 if X f A 

df\A{x)®U^ ifxeA 


Proof. There is an xq e dom/. Translating the origin of the coordinate system to xq 
through replacing / by /(• — xq) would not affect the truth value of the claimed equation. 
Therefore we may assume xq = 0 without loss of generality, so that A = U is even a 
vector subspace of M”. Setting Xi ■= A = U, X 2 ■= and dehning proper functions 
/i : Xi R u {+ 00 }, f 2 '. X 2 -^Mkj {+ 00 } by 


fl{.Xi) := /Ui(xi) 


and 


f2{x2) ■■= 



if X 2 = 0 
if a;2 =H 0 


allows us to write / in the form /(x) = /(xi + X2) = fi{xi) + f 2 {x 2 ) for all x e R'^. 
Applying Theorem B.16 yields 

df{x) = df{xi + X2) = d/i(xi) © df2{x2) 



0 

if X 2 =1= 0 

5/i(xi)©f/^ 

if X 2 = 0 

0 

ifxfU 

dMx)®U^ 

ii X E U 

0 

if X / A 

df\Aix)®U^ 

if X e A 


for every x e R"^. 


□ 
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APPENDIX 0 


Elaborated details 


Detail 1. The intersection of compact subsets of a non-Hausdorff space does not need to be 
compact again: We will construct an example for this phenomenon in three steps. First we 
will obtain a non-Hausdorff space (X', O') by gluing two copies of the interval ([0,1], [0, 

to an “interval” which has two different right-hand side endpoints 1,1. Next we will 
show that homeomorphic copies of the original spaces are contained in (X', O') as certain 
subspaces (X(,X( fRl O') and (X^ X2 fRl O'). Finally we will show that the intersection 
[X'l n X'2, {X'l n X2) (fil O') of these compact subspaces is homeomorphic to the half-open 
interval ([0,1), [0,1) (fil O^^) and hence not compact. Consider the space 

( ({-i)x[o,i])u({i)x[o,i]) ,xtaoy), 

=:X =-.0 

consisting of two copies { — 1} x [0, 1 ] =: X_i and { 1 } x [0, 1 ] =: Xi of the interval [0, 1 ], 
equipped with the usual topology. In order to glue the space (X, O) to an “interval” with 
two right-hand side endpoints we set 


X' := [0,1)U{1,1}, 


where 1 and 1 are two different elements which are not contained in [0,1); moreover we 
equip X' with the identification topology O' which is induced by O and the mapping f : 
X X' given by 


f{t,a) 


t for t e [0,1) 

< 1 for t = 1 and a = 1 
[i t = 1 and a = —1. 


The space (X', O') is not a Hausdorff space since every O'x-neighborhoods U of 1 has 
nonempty intersection with every O'x -neighborhood U_ of 1 because both U and U_ contain 
infinitely many of the points 1 — n e N. However 1 and 1 are the only distinct points 
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in {X',0') which can not be separated from each other by distinct neighborhoods; i.e. all 
subspaces of{X\0'), which contain at most one of these endpoints, are Hausdorff spaces. 
In particular the sets 


K - f[Xa], 

a e { —1,1}, are Hausdorff spaces. By Lemma 2.3.13 the mapping f\xa: ^ as 

a homeomorphism between {Xa, Xa (Hi O) and (X', X'^ ifil O') for ae{ — 1,1}. In particular 
both X'_^ and X} are compact subsets of {X',0'). However their intersection 

A-:,r,X;-/U,[{l)x[0,l)] 

is homeomorphic to ({1} x [0, 1), ({1} x [0,1)) (fil O), i.e. to ([0, 1), [0, 1) fRl and hence 
not compact. We note here that our construction could also be done in a more elegant 
way if we had used the mapping id[o,i) : [0, 1) —» [0,1) as “Anheftungsabbildung” in order 
to stick two copies of the interval ([0,1], [0,1] fRl together, cf. [If, p. 5f]; however 
this would bring the need to introduce further topological notions. Moreover the constructed 
space (X', O') should be homeomorphic to the space presented by Steen and Seebach in 
section “Telophase Topology” of their book “Counterexamples in Topology”, see [22, p. 92j. 

Detail 2. The intersection of two both compact and closed subsets Ki,K 2 of a topological 
space (X, O) is again closed and compact: Clearly Ki n K 2 is again a closed subset of 
(X, O). Due to 

Kin K2 = Kin (Xi n K2) e Xi fnl ^(X, O) = A(Ki, Ki fnl O) 

the intersection Ki n K 2 is also a closed subset of the compact space {Ki, Ki ifTl O) and 
hence a compact subset of this space by part i) of Theorem 2.1.1. From the compactness of 
the subspace {Ki n K 2 , {Ki n K 2 ) fnl {Ki fnl O)) of {Ki, Ki fRl O) we conclude that 

[Ki n K2, {Ki n K2) fRl {Ki fRl O)) = [Ki n K2, {Ki n K2) fnl O) 

is also a compact subspace of the original space (X, O), since beeing compact is an intrinsic 
property of a topological (sub[space, cf. Definition 1.1.7; i.e. Ki n K 2 is a compact subset 
of{X,0). 

Detail 3. The Definition in [22, p. 7fJ is not totally correct: In that book the right order 
topology for a linearly ordered space (X, <) is said to be the topology which is generated 
by basis sets of the form Sa = {x\x > a). However the whole space X needs in general to 
be added to that set system in order to really obtain a basis for a topology: Consider for 
instance the linearly ordered set (X, <) := ([—00, +00], <). The union of all sets Sa is only 
the set (—00, +00] = 1 = X. Instead of adding the set X to the set system formed by the Sa 
the problem could also be repaired by replacing the word “basis” by “subbasis”. For the left 
order order topology there is the very same problem. It can be repaired analogously. 
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Detail 4. K' ■= {z e Z : z ^ z'} is a compact subset of Let {0[)i^i he some open 

covering of K'. At least one of these open sets, lets call it O', must cover z' and hence also 
every z < z!, i.e. every z e K', by the interval-like structure of the set O' e Taking O' 
already yields the needed finite subcover. 

Detail 5. The equivalences in (*) and (o) in the proof of Theorem 2.5.16 hold true: Note 
that the harder direction “<=” of the equivalence in (*) is true, since every compact set 
K e /C(M”) = /C^(R”) is contained in the closed ballMn^O), if the radius R is chosen large 
enough. The other direction is true since we can simply choose K = ]Br(0). Next 
we proof the equivalence in (o). The totally ordered set (Z,^) := ([—oo,+oo], _,_oq]) 

with the natural order on [—oo, +oo] has both a minimum and a maximum. Hence part 
ii) of Lemma 2.f.lf can be applied and we obtain /C^{+oo}([—oo,+oo], T) = {Z\Lf' : U' e 
W'(+oo) n T}. After taking complements this reads 

Wi+oo) r^T = {Z\K' : if'e/C^{+oo}([-oo,+oo], T)} 
which directly shows that the equivalence in (o) is true. 

Detail 6. / : ^ ([—oo,+oo], T) is continuous at the point oo if and only if 

f : » ([— 00, +oo], C><c) is continuous at the point oo; Let f : — >■ 

([— 00 , +oo], T) be continuous at the point oo, i.e. for any T-neighborhood T of +oo = 
/(oo) there is a neighborhood U e of oo with f[U] c T. In order to show that 

f : (K“,Or) ^ ([ — 00, +oo], is continuous at the point oo let any O^-neighborhood 

O of +00 = /(oo) be given. Since O contains a set of the form {a, +oo] —. T e T we 
obtain with some corresponding neighborhood U e O®" o/oo the inclusion f[U] T O 
and have therewith shown one implication. Let now, to the contrary, f : (R(J,,0^) —>■ 
([— 00, +oo], 0<c) be continuous at the point oo, i.e. for any O^-neighborhood O o/ +oo = 
/(oo) there is a neighborhood U e of co with f[U] c O. In particular the mapping 
/:(R“.Or)^([ —00, +oo],T) is also continuous in +oo, since every T-neighborhood of 
+00 is also a O^-neighborhood o/+oo. 

Detail 7. The product space {¥', O') (8) {Y", O") —■ {Y, O) of two locally compact Hausdorff 
spaces is again a locally compact Hausdorjf space: Let {x', x"), {y', y") be two different 
points in {Y,0) with, say, x' + y'. Since (Y',0') is a Hausdorjf space there exist disjoint 
neighborhoods U' and V of x' and y', respectively. Then clearly U ■= U' x Y" and V ■= 
V X Y" are disjoint neighborhoods of{x',x") and {y',y"), respectively, in (Y,0). So the 
latter topological space is again a Hausdorjf space. Moreover {Y, O) is also locally compact: 
Let y = {y',y") e Y. Since {Y',0') and (Y",0") are locally compact there exist compact 
neighborhoods U' e U'{y') and U" e U"{y"). The neighborhood U ■= U' x Lf" of y is then 
compact in virtue of Tichonov’s Theorem 2.3.6. 

Detail 8. The coercivity assertion of Lemma 2.7.1 is contained in Theorem 3.3.6 as special 
case: Fi and Gi are coercive; for instance Fi = (f>oH\Ti;H*) is a concatenation of the coercive 
mapping f and the injective and hence normcoercive linear mapping H\ti;h*; ; IZ{H*) —> 
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7Z(H); cf. also the proof of Theorem 3.2.1. Moreover the mappings Fi : Xi ^ + 00 ] 

and Gi :Yi —* (— 00 , + 00 ] are lower semicontinuous and hence in particular locally bounded. 
Finally F 2 = 0x2 ^2 = Oyj are clearly bounded below. 

Detail 9. Both (||x„j.||)fcgN and (||ynfc||)fcsN would be bounded above by some B > 0; If one 
of this sequences, say (xn,.) without loss of generality, would be unbounded there would be a 
subsequence (xnfcJjsN with ||xnj,,||x ^ +00 as j —>■ + 00 . Since F is normcoercive we would 

get \\F(xni,.)\\z —^ +°o as j —> + 00 . This contradicts (3.2). 

Detail 10. There is an element b e Z\MAX<c(Z) with K' c b\: //Z\MAX<c(Z) contains 
a maximum b then clearly K' c Z\MAX 5 ;(Z) = b\. If Z\y[AX^{Z) contains no maximum 
then we can write 


Z\MAX^(Z)= y b) 

beZ\MAXs:(Z) 

SO that the sets b), where b e Z\ MAX 5 ;(Z), form in particular an open cover of K'. Due 
to the compactness of K' there are finitely many bi,... ,bn ^ Z\ MAX^(Z) with 

n 

i^l 

Denoting the largest of the bi with b we hence have K' c 

Detail 11. The subspaces Xi + Wi and {Xf- n W^) have trivial intersection: Writing an 
arbitrarily chosen x e (Ai + Wi) n Xf n Wi in the form x = xi + wi with some xi e Xi 
and wi e W\ we get (x,xi) = 0 and (x,wi) = 0. Addition gives {x,x} = 0 and hence 

X = 0. 

Detail 12. For real-valued functions Fi,Fi : Xi —> M u {+ 00 }, F 2 , F 2 : X 2 —> R u {+ 00 } 
with Fi kt) F 2 = Fi kt) F 2 there is a constant C* e R such that Fi = Fi C and F 2 = F 2 — C: 
For all xi e Xi and X 2 e X 2 we have Fi{xi) + F 2 {x 2 ) = Fi{xi) + F 2 {x 2 ). Since only finite 
values occur we can rearrange the latter and obtain Fi{xi) — Fi{xi) = ^ 2 ( 2 ^ 2 )— .^ 2 ( 2 ^ 2 ) for all 
Xi E Xi and X 2 ^ X 2 . In particular the functions Fi—Fi : Ai —> R and F 2 —F 2 : X 2 R are 
constant on Xi and X 2 , respectively; by the previous equality, they take the same constant 
value. Denoting this value by “C” we are done. 

Detail 13. If one of the functions Fi, F 2 , Fi, F 2 takes the value +00 there is no guarantee 
that, e.g. F 2 and F 2 differ merely by a real constant; consider for instance the functions 
Fi = Fi = +00 on Xi. Then Fi w F 2 = Fi w F 2 for any functions F 2 , F 2 : A 2 —» Ru {+ 00 }. 

Detail 14. Both Fi and Gi are bounded below: Let || ■ II 2 be the Euclidean norm in R”. 
After setting 

II • II) == (^1, II • II2U1) {Z, <) := ((-00, +00], <(_oo,+oo]) 
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with the natural ordering <(_oo,+oo] on (—oo,+oo] we can apply Theorem 3.1.7 to Fi : 
{X, II • II) —>■ {Z, <) and obtain that Fi is bounded from below. Likewise we see that also Gi 
is bounded below. 

Detail 15. Without loss of generality, we may assume Xi = Xf, Yi = Yf- and Zi = Zf; 
otherwise we can replace 

Fi by Fi = Fi oTix^^x 2 \xf: 

Gi by Gi = Gi ° T^YiX 2 \Yf^ 

Hi by Hi = Hi oTTzi.z^lz^, 

and continue the proof with theses new functions instead of the original functions due to 
the following three reasons: 

i) The assumptions on Fi,Gi carry over to Fi,Gi: Using part i) of Lemma 3.2.4 
see that the new functions differ from the original functions merely by bijective lin¬ 
ear transformations of their image domains. Since the involved spaces are of finite 
dimension these linear bijections are even homeomorphisms. In particular the locally 
boundedness assumption on the original functions carries over to the new functions. 
Also the coercivity assumption on the original functions carries over to the new func¬ 
tions by part i) of Lemma 3.3.3. 

ii) H stays unchanged when replacing the old function by the new ones: part i) of Lemma 
3.3.3 gives Fi w 0^2 = Fi w 0^2 (ind Gi w 0^2 = Gi w 0^2 so that 

H = (Fi w 0 ^ 2 ) + {Gi w 0 ^ 2 ) 

= (Fi w 0^2) + (Gi 0^2) 

iii) After proving the coercivity of Hi also the coercivity of Hi would follow: Using parts 
ii) and i) of Lemma 3.3.3 we can rewrite H in the form 

F = Fi w 0^2 = ® 0^2 

so that part i) of Lemma 3.3.3 ensures that Hi is coercive iff Hi is coercive. 

Detail 16. F^ is a hyperplane in U = aff(dom^'); The subspace F^ := Hf^ n F is of 
dimension dim F" = dim Hf^ + dim U — dim(F + Hf^) e n — 1 + dim U — {n,n — 1} = 
{dim F, dim F — 1}. The set Hf^ does not completely contain S; consequently F" c 
can not completely contain aff(dom^') ^ S all the more, so that only dimF = 
dim(aff(dom d')) — 1 can be true. Therefore H^ is a hyperplane in aff(domd'). 

Detail 17. For a e (0, |) we have || V 5 'a(- 2^^^)||2 —^ +00 as k ^ +00 for any sequence 
in Q, converging to some boundary point of Q: Since all norms in are 
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equivalent it suffices to show ||V 5 'a(^^^^)||oo —>■ +°C). We haveVgffiz) = —”^( 2 ^ 2 , 
for all z e Q, so that || Vg'Q( 2 ;)||oo = a! 2 :“~^ 2 ; 2 ~^ max{z 2 , zi} for these z. In case = (0, 0)"^ 
we thus have for a e ( 0 , |) the estimate 

||Vfi^„( 2 ;(''))||oo ^ a{max{zf\ 2 ? ^}]“”^[max{ 2 ;f \ 2 :f vaax{zf\zf^] 

= a{max{zf\z^^'^}f^-^ - +00 

as k ^ + 00 . In case =|= (0,0) we may assume, due to symmetry reasons, = 0 and 
^ 2 *^ > 0 without loss of generality. We then obtain 

||V^„( 2 ;(''))||oo = ^ +00 

as k ^ + 00 , even for a e ( 0 , 1 ). 

Detail 18. The functions f and g are hounded from below: If, say f, was not bounded from 
below there would be a sequence {uk)ken in the compact level set levQ,(/) with f{uk) —>■ —00 
for A; —>■ + 00 . However, after choosing a subsequence which converges to some u e levQ,(/) 
we had f{u) = — 00 , by the lower semicontinuity of f. But this would mean that f is not 
proper - a contradicition. 

Detail 19. All assumptions of part Hi) of Lemma f.3.18 are fulfilled for F ■= $, Ui ■= 
Xi ©X 3 , U 2 ■= X 2 and G ■= <.ievT||L-||; •= dl{L*), V 2 ■= M^L), for appropriately chosen 

a and j3: 


• f /2 n V 2 = { 0 } holds true, beeing an assumption of the current theorem. 

• dom Fndom G = dom $nlevT-||L • || =|= 0; Each neighborhood o/O e domF intersects 
doniF. Since r > 0 ensures 0 e int(levT-||L • ||) we thus have in particular for this 
neighborhood 0 =|= domF n int(levT-||L ■ ||) ^ domF n levT-||L • ||. 

• levQ,(F|; 7 j) is nonempty and bounded for an a e R; Denoting the unique minimizer 
of the strictly convex function (f) = $|xi by x and setting a ■■= (f^x) we see that 
levQ,(F|[/J = levQ,(^) © {0} = {x} is nonempty and bounded. 

• Finally lev/ 3 (G|yj) is nonempty and bounded for any /5 ^ 0, since G\v-^ is a norm 

- namely the norm on Vi, which makes (Vi,G|yj) isometrically isomorph to {IZ{L), 
II ■ ||| 72 {l ))7 by virtue of the bijection : IZ{L*) —> IZ{L). 

Detail 20. All assumptions of part Hi) of Lemma f.3.18 are fulfilled for Ui ■= Xi © X^, 
U 2 ■= X 2 , Vi ■■= IZ{L*), V 2 '■= Af{L) and F •■= iargmin(<i>)? •= 11-^ ■ II; appropriate choice 

of a and j3: 

• F, G are in ro(R”) and have the needed translation invariance. 

• 1/2 n V 2 = { 0 } holds true, beeing an assumption of the current theorem. 
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• levQ(F|;7i) is nonempty and bounded for an a: Denoting the unique minimizer of (j) 
with xi we have argmin<l> = {xi} 0 X 2 c Xi 0 X 2 . For a ■= F{xi) = 0 the set 
levQ,(F|c/J = {xi} is then obviously nonempty and bounded. 

• Finally lev/ 3 (G|yJ is nonempty and bounded for any /5 ^ 0, since G\v-^ is a norm 
- namely the norm on Vi, which makes (Vi,G|yj) isometrically isomorph to (TZ{L), 

II ■ ||| 7 e(L ))7 by virtue of the bijection : 1 Z{L*) 71{L). 

Detail 21. d = 0 argmin<l> n A/’(L) =|= 0 : Using Fermat’s Rule, see [19, p. 264, b 
8 ]; 0 e ri(dom$*), see part Hi) in Lemma 4-4-G in order to apply the chain rule, see [19, 
Theorem 23.9j and x e d<l>*(x*) x* e d$(x), see [19, Corollary 23.5.1] we obtain 

d = 0 0 e argmin $*(—L*-) 

^Oed[<l>0-L*-)]|o 
^ 0 e -Ld^*{-L*Q) 

3x e M" : X e d<l>*(0) a 0 = —Lx 
3x e M" : 0 e d$(x) a x e J\f{L) 
argmin $ n A/’(L) =|= 0. 

Detail 22. There is a decomposition aff(domF) = Ap@Pp such that Pp is a subspace of 
P[F] and such that F is strictly convex on int^(domF|^); We set F = $* : —>■ M u 

{+cc}, M{-) =-L*-. Note now that Q e Tifdom. E) r\P{M) and that aS.{dom. E) = Xi@X^, 
where X 3 is a subspace of P[E], by Lemma 4-4-G ond where E = ^* is strictly convex 
on intxi(dom$*|xi) = ri(dom$*|xi), since it is even essentially strictly convex on Xi 
by Lemma 4-4-^- Thus we can use Theorem 4-3.16 and obtain that aff(domF) can be 
decomposed in the claimed way. 

Detail 23. The functions F(-) = $(—F*-) and G{-) = r|| ■ ||* fulfill the assumptions 
of Theorem 4-3.21: Due to 0 = —L*0 e ri(dom<l>*) and dom$* = Xi ® X 3 we see 
that Theorem 4-3.16 can be applied to F = $* and M(-) = -L*-. Thereby we get a 
decomposition aff(domF) = A@P o/aff(domF) =: A into a vector subspace P of the 
periods space P[E] and an affine subspace A c such that F is strictly convex on 
int^(domF|^). Furthermore F is essentially smooth on A by Theorem 4-3.12. 

Detail 24. The assumptions of Theorem 4-3.21 are fulfilled for F = $ and G{-) = X\\L - 
II .■ Clearly F and G are convex functions with ri(domF) n ri(domG) =|= 0. Moreover 
the decomposition aff(domF) = Xi 0 X 2 , or rather their components, have the needed 
properties by our setting’s assumptions: X 2 is a subspace of P[F], F is strictly convex on 
intxi(domF|xi) = ri(domF|xi) =|= 0, and lastly F is essentially smooth on Xy 

Detail 25. / is again proper, convex, lower semicontinuous and essentially smooth: f is 
proper since ^ =■ F is proper and because /(I) = F(x) < + 00 . Moreover f also inherits 
convexity and lower semicontinuity from F. Finally f is essentially smooth: Part ii) of 
Lemma B .6 gives x e argmin F c ri(domF), so that Theorem 4-3.12 can be applied to 
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C. Elaborated details 


E = F and M(-) = -x, giving the essentially smoothness of f = F o M on aff(dom/) = R; 
note for the last equality - in the nontrivial case £ =|= 0 - the above x e ri(domF) and our 
setting assumption 0 e doniF. 

Detail 26. The function g, given by g{T) ■■= ||p||;^ with any p e SOL(Di t-), r e (0, c) 
is well defined, since SOL(Dit-) =|= 0 and since Theorem 4.3.21 ensures ||p||* = ||g||* 
for any other q e SOL(Di^t-); Consider F(-) = $(—L*-) and G(-) = r|| ■ ||*. Due to 
0 = -L*0 e ri(dom<l>*) and dom<l>* = Xi ©X 3 we see that Theorem 4-3.16 can be applied 
to E = ^* and M(-) = -L*-. Thereby we get a decomposition aff(domF) = A@P of 
aff(domF) =: A into a vector subspace P of the periods space P[F] and an affine subspace 
A c R^ such that F is strictly convex on int^(domF|^). We may assume without loss of 
generality that A is a vector subspace as well, since 0 e A. Furthermore F is essentially 
smooth on A by Theorem 4-3.12 and even on A by Lemma 4-3.11. Theorem 4-3.21 can 
thus be applied, giving t||p||* = G{p) = G{q) = T||g||*. Since r =|= 0 we get the claimed 

WPh = 

Detail 27. IIpII^ < d : Theorem 4-3.6 ii) ensures p e SOL(D 2 j|p||^); hence we must 
have IIpII;^ < d since the assumption ||p||h= ^ d would imply, by Theorem 4-4-4; that p e 
SOL(D 2 ,j|p||) c argmin$*(—L*-), resulting inp e SOL(Di^t-) n argmin$*(—L*-). This con¬ 
tradicts the relation SOL{Di r) n argmin $*(—L*-) = 0 from Theorem 4-4-4 which holds 
since t e ( 0 , c). 

Detail 28. The function f, given by /(A) := \\Lx\\ with any x e SOL(P 2 ,a); A e (0,d), is 
well defined, since SOL(P 2 ,a) =1= 0 and since Theorem 4-3.21 ensures \\Lx\\ = \\Lx\\ for any 
other X e SOL(P 2 ,a)-' For F = $ and G(-) = X\\L - 1| all assumptions of Theorem 4-3.21 are 
fulfilled; note herein that F and G are convex functions with ri(domF) n ri(domG) =|= 0 
and that the decomposition aff (dom F) = Xi®X 2 fits to the assumptions of Theorem 4-3.21: 
X 2 is a subspace of P[F] and F is strictly convex on intxi(domF|xi) = ri(domF|xi) =|= 0. 
Lastly F is essentially smooth on Xi. Applying Theorem 4-3.21 gives now A||La;|| = G{x) = 
G{x) = A II Lx II and hence the claimed ||Lx|| = ||Lx||. 

Detail 29. ||Lx|| < c : Theorem 4-2.6 ii) ensures x e SOL(Pijl£||); so we must have ||Lx|| < 
c, since the assumption \\Lx\\ ^ c would imply, by Theorem 4-4-4; that x e SOL(Pij| 2 ,£||) c 
argmin resulting in x e SOL(P 2 ,a) argmin $. This contradicts the relation SOL(P 2 ^a) 
argmind* = 0 from Theorem 4-4-4 which holds since A e (0,d). 

Detail 30. The equations 


SOL(Pi,,) n SOL(Piy) = 0 , 

SOL(P2,a) n SOL(P2 ,a0 = 0 

hold true for all distinct t,t' e (0,c) and all distinct A, A' e (0,d), respectively: If there 
were e.g. distinct A, A' e (0, d) with, say A < A', such that there would be ap e SOL(P 2 ,a) 
SOL(P 2 ,a') we had ||]5||* ^ A < A' and p e argmin d>*(—L*-) subject to || ■ ||* < X', so that p 
would be a local minimizer of ^*{—L*-). Hence, p e argmin d>*(—L*0 by the convexity of 
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$*(— L*-). This, however, contradicts argmin $*(— L*-) n S 0 L(D 2 ,a') = 0, which holds by 
Theorem 4-4-4 since X' e (0, d). The proof of the other equation is done just analogously. 

Detail 31. For an arbitrarily chosen A e (0, d) and X' ■= g{f{X)) we have X = X': Using 
(4.25) and (4.24) with r = /(A) yields 


SOL(P 2 ,a) c SOL(Pij(,)) c SOL(P 2 ,a'), 

SOL(P 2 ,a) c SOL(Pij(,)) c SOL(P 2 ,a') 

Since SOL(P 2 ,a) ^ 0 we must have X = X', in order to avoid a contradiction to (4.30). 

Detail 32. Lemma A. 2 implies inf/j^gXin§i,/i2£X2nSi(^i; ^2) > —1 by the following reason: 
By this lemma there is a constant C ^ 1 > 0 such that ^||/ii ||2 ^ ||^i + 1^2^ = + 

||/i 2||2 + 2(/ii, h0 for all hi e Xi and /i 2 e ^ 2 - For hi e XiCi §1 and /i 2 e ^2 Si we obtain 
in particular 0102 ) ^ = -1 + ^ =: 7 , so i/iai infft^gXinSi,h 2 eX 2 nSi</ii,/i 2 > ^ 

7 > — 1 holds indeed true. 
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Some remarks to the thesis 


Between the preceding thesis and the “vorgelegte Dissertation” there are some minor dif¬ 
ferences. When handing in the “vorgelegte Dissertation” the “Summary” and the “Zusam- 
menfassung” were printed on separate pages outside of the thesis, whereas here they were 
included inside the thesis itself. Moreover Typos, obvious small local errors and certain 
inconsequencies in notation were corrected. In particular the zerovector of the Euclidean 
space M” should now everywhere be denoted by 0 (with exception for n = 1 where the 
notation 0 might be used). 

We hnally note that an electronic version of this work is available via ArXive, see 
http://arxiv.org/a/ciak_r_l 

The reader may want to check this webpage also for Erata / Update (maybe additionally 
containing a new space concept, which was not yet developed enough to be included in the 
“vorgelegte Dissertation”) 
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